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IMPROVED POLYAMIDES FOR BINDING IN THE MINOR 
GROOVE OF DOUBLE STRANDED DNA 

5 The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 

26453, 27681 and 47530 awarded by the National Institute of Health. 

CROSS REFERENCE TO RELATED APPLICATIONS 

10 This application is a continuation-in-part of PCT/US97/03332 filed February 20, 1997, 

Serial No. 08/853,522 filed May 8, 1997 and PCT/US 97/12722 filed July 21, 1997 which are 
continuation-in-part applications of Serial No. 08/837,524, filed April 21, 1997, Serial No. 
08/607,078, filed February 26, 1996, provisional application Serial No. 60/042,022, filed April 
16, 1997 and provisional application Serial No. 60/043,444, filed April 8, 1997. 

15 

BACKGROUND OF THE INVENTION 

Field of the Invention 

20 This invention relates to polyamides which bind to predetermined sequences in the 

minor groove of double stranded DNA. 

Description of the Related Art 

25 The design of synthetic ligands that read the information stored in the DNA double helix 

has been a long standing goal of chemistry. Cell-permeable small molecules which target 
predetermined DNA sequences are useful for the regulation of gene-expression. 
Oligodeoxynucleotides that recognize the major groove of double-helical DNA via triple-helix 
formation bind to a broad range of sequences with high affinity and specificity. Although 

30 oligonucleotides and their analogs have been shown to interfere with gene expression, the triple 
helix approach is limited to purine tracks and suffers from poor cellular uptake. The 
development of pairing rules for minor groove binding polyamides derived from N- 
methylpyrrole (Py) and N-methylimidazole (Im) amino acids provides another code to control 
sequence specificity. An Im/Py pair distinguishes G»C from C*G and both of these from A«T 

35 or T«A base pairs. Wade, W.S., Mrksich, M. & Dervan, P.B. describes the design of peptides 
that bind in the minor groove of DNA at 5 , -(A,T)G(A,T)C(A,T)-3' sequences by a dimeric 
side-by-side motif. 1 Am. Chem. Soc. 114, 8783-8794 (1992); Mrksich, M. et al describes 
antiparallel side-by-side motif for sequence specific-recognition in the minor groove of DNA by 
the designed peptide l-methylimidazole-2-carboxamidenetropsin. Proc. Natl. Acad. Sci. USA 

40 89, 7586-7590 (1992); Trauger, J.W., Baird, E. E. Dervan, P.B. describes the recognition of 
DNA by designed ligands at subnanomolar concentrations. Nature 382, 559-561 (1996). A 
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Py/Py pair specifies A«T from G«C but does not distinguish A#T from T^A. Pelton, J.G. & 
Wemmer, D.E. describes the structural characterization of a 2-1 distamycin A- 
d(CGCAAATTTGGC) complex by two-dimensional NMR. Proc. Natl Acad. Sci. USA 86, 
5723-5727 (1989); White, S., Baird, E. E. & Dervan, P.B. Describes the effects of the A*T/T#A 

5 degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996); White, S., Baird, E. E. & Dervan, P. B. describes the 
pairing rules for recognition in the minor groove of DNA by pyrrole-imidazole polyamides. 
Chem. & Biol. 4, 569-578 (1997); White, S., Baird, E. E. & Dervan, P.B. describes the 5'-3* N- 
C orientation preference for polyamide binding in the minor groove. In order to break this 

10 degeneracy, a new aromatic amino acid, 3-hydroxy-N-methylpyrTole (Hp) incorporated into a 
polyamide and paired opposite Py, has been found to discriminate A«T from T*A. The 
replacement of a single hydrogen atom on the pyrrole with a hydroxy group in a Hp/Py pair 
regulates affinity and specificity of a polyamide by an order of magnitude. Utilizing Hp 
together with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, 

15 Hp/Py, and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor 
groove of DNA. 

SUMMARY OF THE INVENTION 

2 <> The invention encompasses improved polyamides for binding to the minor groove of 

double stranded ("duplex") DNA. The polyamides are in the form of a hairpin comprising two 
groups of at least three consecutive carboxamide residues, the two groups covalently linked by 
an aliphatic amino acid residue, preferably y-aminobutyric acid or 2,4 diaminobutyric acid, the 
consecutive carboxamide residues of the first group pairing in an antiparallel manner with the 

25 consecutive carboxamide residues of the second group in the minor groove of double stranded 
DNA. The improvement relates to the inclusion of a binding pair of Hp/Py carboxamides in the 
polyamide to bind to a T»A base pair in the minor groove of double stranded DNA or Py/Hp 
carboxamide binding pair in the polyamide to bind to an A*T base pair in the minor groove of 
double stranded DNA. The improved polyamides have at least three consecutive carboxamide 

30 pairs for binding to at least three DNA base pairs in the minor groove of a duplex DNA 
sequence that has at least one A»T or T»A DNA base pair, the improvement comprising 
selecting a Hp/Py carboxamide pair to correspond to a ^^A base pair in the minor groove or a 
Py/Hp carboxamide pair to bind to an A«T DNA base pair in the minor groove. Preferably the 
binding of the carboxamide pairs to the DNA base pairs modulates the expression of a gene. 

In one preferred embodiment, the polyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, the polyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
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embodiment, the polyamide includes at least six consecutive carboxamide pairs for binding to at 
least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
polyamides have four carboxamide binding pairs that will distinguish A«T, T»A, OG and G*C 
base pairs in the minor groove of a duplex DNA sequence. The duplex DNA sequence can be a 
5 regulatory sequence, such as a promoter sequence or an enhancer sequence, or a gene sequence, 
such as a coding sequence or a non-coding sequence. Preferably, the duplex DNA sequence is a 
promoter sequence. 

The preparation and the use of polyamides for binding in the minor groove of double 
10 stranded DNA are extensively described in the art. This invention is an improvement of the 
existing technology that uses 3-hydroxy-N-methylpyrrole to provide carboxamide binding pairs 
for DNA binding polyamides. 

The invention encompasses polyamides having y-aminobutyric acid or a substituted y- 
15 aminobutyric acid to form a hairpin with a member of each carboxamide pairing on each side of 
it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the polyamides may contain an aliphatic 
amino acid residue, preferably a P-alanine residue, in place of a non-Hp carboxamide. The p- 
alanine residue is represented in formulas as p. The P-alanine residue becomes a member of a 
20 carboxamide binding pair. The invention further includes the substitution as a p«p binding pair 
for non-Hp containing binding pair. Thus, binding pairs in addition to the Hp/Py and Py/Hp are 
Im/p, p/Im, Py/p, p/Py, and p/p. 

The polyamides of the invention can have additional moieties attached covalently to the 
25 polyamide. Preferably the additional moieties are attached as substituents at the.amino terminus 
of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4-diaminobutyric 
acid residue. Suitable additional moieties include a detectable labeling group such as a dye, 
biotin or a hapten. Other suitable additional moieties are DNA reactive moieties that provide 
for sequence specific cleavage of the duplex DNA. 

30 

Brief Description of the Drawings 

Figure 1 illustrates the structure of polyamide 1^ 2, and 3. 
Figure 2 illustrates the pairing of polyamides to DNA base pairs. 
35 Figure 3 illustrates the DNase footprint titration of compounds 2 and 3. 

Figure 4 illustrates a list of the structures of representative Hp containing polyamides. 
Figure 5 illustrates the synthesis of a protected Hp monomer for solid phase synthesis. 
Figure 6 illustrates the solid phase synthesis of polyamide 2. 
Figure 7 illustrates the 1H-NMR characterization of polyamide 2. 
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Figure 8 illustrates the Mass spectral characterization of polyamide 2. 
Figure.9 illustrates 1H-NMR characterization of synthesis purity. 
Figure 10 illustrates DNasel footprint titration experiment. 
Figure 1 1 illustrates the synthesis of bifunctional conjugate of polyamide 2. 
5 Figure 1 2 illustrates affinity cleaving evidence for oriented hairpin formation. 

Figure 13 illustrates increased sequence specificity of Hp/Py containing polyamides. 
Figure 14 illustrates 8-ring hairpin polyamides which target 5'-WGTNNW-3' sites. 
Figure 15 illustrates 8-ring hairpin polyamides which target 5'-WGANNW-3' sites. 
Figure 16 illustrates 8-ring hairpin polyamides which target S'-WGGNNW-S' sites. 
1 0 Figure 1 7 illustrates 8-ring hairpin polyamides which target 5 '- WGCNNW-3 ' sites. 

DETAILED D ESCRIPTION OF THE INVENTION 

Within this application, unless otherwise stated, definitions of the terms and illustration 
15 of the techniques of this application may be found in any of several well-known references such 
as: Sambrook, J., et ai, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press (1989); Goeddel, D., ed., Gene Expression Technology, Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 
Deutshcer, M.P., ed., Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, et 
20 al., PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA 
(1990); Freshney, R.I., Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed., Alan 
Liss, Inc. New York, NY (1987); Murray, E.J., ed.. Gene Transfer and Expression Protocols, 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI, Oxford University 
Press, New York (1997). 

25 

For the purposes of this application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 
region, non-coding regions preceding ("leader") and following ("trailer") the coding region, as 

30 well as intervening non-coding sequences ("introns") between individual coding segments 
("exons"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters are often upstream (" '5 to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 
known, including sequences involved with the binding of transcription factors, including 

35 response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 
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yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function in either orientation (5 y -3* or 3'-5') and in any location (upstream 
or downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous ligand (e.g. a transcription factor) to the regulatory 
5 sequence increases transcription, thereby resulting in increased expression of the corresponding 
target gene. In such a case, interference with transcription by binding a polyamide to a 
regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also include or be adjacent to a regulatory sequence known in the art 
10 as a silencer. A silencer sequence generally has a negative regulatory effect on expression of 
the gene. In such a case, expression of a gene may be increased directly by using a polyamide 
to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 

15 It is to be understood that the polyamides of this invention bind to double stranded DNA 

in a sequence specific manner. The function of a segment of DNA of a given sequence, such as 
5'-TATAAA-3\ depends on its position relative to other functional regions in the DNA 
sequence. In this case, if the sequence S'-TATAAA-^ on the coding strand of DNA is 
positioned about 30 base pairs upstream of the transcription start site, the sequence forms part 

20 of the promoter region (Lewin, Genes VI pp. 831-835). On the other hand, if the sequence 5'- 
TATAAA-3' is downstream of the transcription start site in a coding region and in proper 
register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI pp. 213-215). 

25 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
such as RNA polymerase, transcription factors, TBF, TFIIIB and other proteins. The effect on 
gene expression of polyamide binding to a segment of double stranded DNA is believed to be 
30 related to the function, e.g., promoter, of that segment of DNA. 

It is to be understood by one skilled in the art that the improved polyamides of the 
present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 
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5,578,444 describes numerous promoter targeting sequences from which base pair sequences 
for targeting an improved polyamide of the present invention may be identified. 

It is generally understood by those skilled in the art that the basic structure of DNA in a 
5 living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in common molecular 
biology references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

To affect gene expression in a cell, which may include causing an increase or a decrease 
10 in gene expression, a effective quantity of one or more polyamide is contacted with the cell and 
internalized by the cell. The cell may be contacted in vivo or in vitro. Effective extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et ai, Nature 387 202-205 (1997). To 
determine effective amounts and concentrations of polyamides in vitro, a suitable number of 
15 cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or medium by detecting the amount of the protein gene product present as 
determined by various techniques utilizing specific antibodies, including ELISA and western 
blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
20 detecting the amount of messenger RNA present as determined by various techniques, including 
northern blot and RT-PCR. 



Similarly, to determine effective amounts and concentrations of polyamides for in vivo 
administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 

25 fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of the protein gene product present as determined by various techniques utilizing 
specific antibodies, including ELISA and western blot. Alternatively, gene expression 
following exposure to a polyamide can be monitored by the detecting the amount of messenger 

30 RNA present as determined by various techniques, including northern blot and RT-PCR. 

The polyamides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be found 
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in Remington: The Science and Practice of Pharmacy, 19th ed., Mack Publishing Co., Easton, 
PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
5 pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research purposes. The polyamide composition comprises pharmaceutical^ 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
solid, liquid, gel, or aerosol form. The polyamide composition of the present invention may be 
administered in various dosage forms orally, parentally, by inhalation spray, rectally, or 
10 topically. The term parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrastemal, infusion techniques or intraperitoneally. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
15 compound, the intended use, the nature and severity of the condition being treated or diagnosed, 
the age, weight, gender, physical condition and mental acuity of the intended recipient as well 
as the route of administration. Such considerations are within the purview of the skilled artisan. 
Thus, the dosage regimen may vary widely, but can be determined routinely using standard 
methods. 

20 

Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative purposes. The sample 
containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. Alternatively, polyamides linked to a 
25 suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

The design of bifunctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 
30 specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional molecule, providing the corresponding 
bifunctional conjugates useful in molecular biology, genomic sequencing, and human medicine. 
Polyamides of this invention can be conjugated to a variety of functional molecules, which can 
be independently chosen from but is not limited to arylboronic acids, biotins, polyhistidines 
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comprised from about 2 to 8 amino acids, haptens to which an antibody binds, solid phase 
supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, bromoacetamide, 
iodoacetamide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, texas red, 
anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 
4-(psoraen-8-yloxy)-butyrate, tartaric acid, (+)-a-tocopheral, psoralen, EDTA, methidium, 
acridine, Ni(II>Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 
bifunctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, DNA photocleavage. Such bifunctional polyamides are useful for 
DNA detection by providing a polyamide linked to a detectable label. Detailed instructions for 
synthesis of such bifunctional polyamides can be found in copending U.S. provisional 
application 60/043,444, the teachings of which are incorporated by reference. 



DNA complexed to a labeled polyamide can then be determined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
a polyamide linked to biotin can be detected by a streptavidin / alkaline phosphatase system. 

The present invention also describes a diagnostic system, preferably in kit form, for 
assaying for the presence of the double stranded DNA sequence bound by the polyamide of this 
invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body fluid 
samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, and 
preferably the amount, of the double stranded DNA sequence bound by the polyamide in the 
sample according to the diagnostic methods described herein. 



The diagnostic system includes, in an amount sufficient to perform at least one 
25 assay, a specific polyamide as a separately packaged reagent. Instructions for use of the 
packaged reagent(s) are also typically included. As used herein, the term "package" refers 
to a solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
30 milligram quantities of a contemplated polyamide or it can be a microliter plate well to which 
microgram quantities of a contemplated polypamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amounts of reagent and sample to be admixed, maintenance time 
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periods for reagent or sample admixtures, temperature, buffer conditions and the like. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
5 labels, such as biotin,~and detecting or indicating means, such as enzyme-linked (direct or 
indirect) streptavidin, are well known in the art. 

Figure 1 shows representative structures of polyamides. ImlmPyPy-y-ImPyPyPy-p-Dp 
(1), ImlmPyPy-y-ImHpPyPy-p-Dp (2), and ImlmHpPy-y-ImPyPyPy-p-Dp (3). (Hp = 3- 

10 hydroxy-N-methylpyrrole, Im = N-methylimidazole, Py = N-methylpyrrole, p = p-alanine, y = 
y-aminobutyric acid, Dp = Dimethylaminopropylamide). Polyamides were synthesized by solid 
phase methods using Boc -protected 3-methoxypyrrole. imidazole, and pyrrole aromatic amino 
acids, cleaved from the support by aminolysis, deprotected with sodium thiophenoxide, and 
purified by reversed phase HPLC. Baird, E. E. & Dervan, P. B. describes the solid phase 

15 synthesis of polyamides containing imidazole and pyrrole amino acids. 1 Am. Chem. Soc. 118, 
6141 -6146 (1996); also see PCT US 97/003332. The identity and purity of the polyamides 
were verified by l H NMR, analytical HPLC, and matrix-assisted laser-desorption ionization 
time-of-flight mass spectrometry (MALDI-TOF MS-monoisotopic): 1 1223.6 (1223.6 
calculated), 2 1239.6 (1239.6 calculated); 3 1239.6 (1239.6 calculated). 

20 

Figure 2 illustrates binding models for polyamides 1-3 in complex with 5 , -TGGTCA-3' 
and 5'-TGGACA-3' (A^T and T«A in fourth position highlighted). Filled and unfilled circles 
represent imidazole and pyrrole rings respectively; circles containing an H represent 3- 
hydroxypyrrole, the curved line connecting the polyamide subunits represents y-aminobutyric 
25 acid, the diamond represents p-alanine, and the + represents the positively charged 
dimethylaminopropylamide tail group. 

Figure 3 shows quantitative DNase I footprint titration experiments with polyamides 2 
and 3 on the 3' 32 P labeled 250-bp pJK6 £coRI/PvwII restriction fragment. Lane 1, intact DNA; 
30 lanes 2-11 DNase I digestion products in the presence of 100, 50, 20, 10, 5, 2, 1, 0.5, 0.2, 0.1 
nM polyamide, respectively; lane 12, DNase I digestion products in the absence of polyamide; 
lane 13, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. describes an 
adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823-7830 (1987). 
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All reactions were done in a total volume of 400 ^L. A polyamide stock solution or H 2 0 was 
added to an assay buffer containing radiolabeled restriction fragment, with the final solution 
conditions of 10 mM Tris-HCl, 10 mM KCl, 10 mM MgCl 2 , 5 mM CaCl 2> pH 7.0. Solutions 
were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting reactions. 
5 Footprinting reactions," separation of cleavage products, and data analysis were carried out as 
described. White, S., Baird, E. E. & Dervan, P. B. Effects of the A*T/T*A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 55, 6147- 
6152(1996). 

10 Figure 4 shows the structure and equilibrium dissociation constant for numerous 

compounds of the present invention. Polyamides are shown in complex with their respective 
match site. Filled and unfilled circles represent imidazole (Im) and pyrrole (Py) rings, 
respectively; circles containing an H represent 3-hydroxypyrroIe (Hp), the curved line 
connecting the polyamide subunits represents y-aminobutyric acid (y), the diamond represents 

15 p-alanine (P), and the + represents the positively charged dimethylaminopropylamide tail group 
(Dp). The equilibrium dissociation constants are the average values obtained from three DNase 
I footprint titration experiments. The standard deviation for each set is less than 15% of the 
reported number. Assays were carried out in the presence of 10 mM Tris*HCl, 10 mM KCl, 10 
mM MgCl 2 , and 5 mM CaCl 2 at pH 7.0 and 22°C. 

20 

Figure 5 shows the synthetic scheme for 3-O-methyl-N-Boc protected pyrrole-2- 
carboxylate. The hydroxypyrrole monoester can be prepared in 0.5 kg quantity using published 
procedures on enlarged scale. 

25 Figure 6 shows the solid phase synthetic scheme for ImlmPyPy-y-ImHpPyPy-P-Dp 

starting from commercially available Boc-p-Pam-Resin: (i) 80% TFA/DCM, 0.4 M PhSH; (ii) 
Boc-Py-OBt, DIEA, DMF; (iii) 80% TFA/DCM, 0.4 M PhSH; (iv) Boc-Py-OBt, DIEA, DMF; 
(v) 80% TFA/DCM, 0.4 M PhSH; (vi) Boc-3-OMe-Py-OH, HBTU, DMF, DIEA; (vii) 80% 
TFA/DCM, 0.4 M PhSH; (viii) Boc-Im-OH, DCC, HOBt; (ix) 80% TFA/DCM, 0.4 M PhSH; 

30 (x) Boc-y-aminobutyric acid, DIEA, DMF; (xi) 80% TFA/DCM, 0.4 M PhSH; (xii) Boc-Py- 
OBt, DIEA, DMF; (xiii) 80% TFA/DCM, 0.4 M PhSH; (xiv) Boc-Py-OBt, DMF, DIEA; (xv) 
80% TFA/DCM, 0.4 M PhSH; (vxi) Boc-Im-OH, DCC, HOBt (xvii) 80% TFA/DCM, 0.4 M 
PhSH; (xviii) imidazole-2-carboxylic acid, HBTU, DIEA; (xviv) dimethylaminopropylamine, 
55 °C, 18h. Purification by reversed phase HPLC provides ImlmPyPy-y-ImOpPyPy-p-Dp. (Op 

35 = 3-methoxypyrrole). Treatment of the 3-methyoxypyrrole polyamide with thiophenol, NaH, 
DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC purification. 
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Figure 7 shows the aromatic region from 7-11 ppm for the 1H-NMR spectrum 
determined at 300 MHz for ImlmPyPy-y-ImOpPyPy-p-Dp and ImlmPyPy-y-ImHpPyPy-p-Dp. 
This region of the spectrum may be used to determine compound identity and purity. 

5 Figure 8 shows-the MALDI-TOF mass spectrum determined in positive ion mode with a 

monoisotopic detector for the polyamides for ImlmPyPy-y-ImOpPyPy-P-Dp and ImlmPyPy-y- 
ImHpPyPy-P-Dp. This spectrum may be used to determine compound identity and purity. 

Figure 9 shows the methyl group region from 3.5-4.0 ppm for the 1H-NMR spectrum 
10 determined at 300 MHz for ImPyPy-y-OpPyPy-P-Dp and ImPyPy-y-HpPyPy-P-Dp. This region 
of the spectrum may be used to directly follow the progress for conversion of 3-methoxypyrrole 
to 3-hydroxypyrrole. 

Fig. 10 shows quantitative DNase I footprint titration experiments with the polyamides 

!5 ImPyPy-y-PyHpPy-p-Dp and ImHpPy-y-PyPyPy-p-Dp on the 3'- 32 P labeled 370-bp pDEHl 
EccRUPvuW restriction fragment. Intact lane, labeled restriction fragment no polyamide or 
DNase I added; lanes 1-10, DNase I digestion products in the presence of 10 fiM, 5 |uM, 2 ^M, 
1 nM, 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM ImPyPy-y-PyPyPy-p-Dp, respectively 
or 1 \lM 9 500 nM, 200 nM, 100 nM, 50 nM, 20 nM, 10 nM, 5 nM, 2 nM, 1 nM ImHpPy-y- 

20 PyPyPy-p-Dp, respectively; DNase I lane, DNase I digestion products in the absence of 
polyamide; A lane, adenine-specific chemical sequencing. Iverson, B. L. & Dervan, P. B. 
describes an adenine-specific DNA chemical sequencing reaction. Methods EnzymoL 15, 7823- 
7830 (1987). All reactions were done in a total volume of 40 ^iL. A polyamide stock solution 
or H 2 0 was added to an assay buffer containing radiolabeled restriction fragment, with the final 

25 solution conditions of 10 mM Tris-HCL 10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0. 
Solutions were allowed to equilibrate for 4-12 h at 22 °C before initiation of footprinting 
reactions. Footprinting reactions, separation of cleavage products, and data analysis were 
carried out as described. White, S., Baird, E. E. & Dervan, P. describe the pairing rules for 
recognition in the minor groove of DNA by pyrrole-imidazole polyamides. Chemistry & 

30 Biology 4, 569-578 (1997). 

Figure 1 1 shows the synthesis of a Afunctional polyamide which incorporates the Hp/Py 
pair. Treatment of a sample of ImlmPyPy-y-ImHpPyPy-p-Pam-resin (see Figure 6) with 3,3'- 
diamino-Af-methyldipropylamine, 55°C, 18 h followed by reverse phase HPLC purification 
35 provides the Op polyamide with a free primary amine group which can be coupled to an 
activated carboxylic acid derivative. Treatment with (i) EDTA-dianhydride, DMSO/NMP, 
DIEA, 55 °C; (ii) 0.1M NaOH, followed by reverse phase HPLC purification provides the Op- 
Py-Im-polyamide-EDTA conjugate. Treatment of the 3-methyoxypyrrole polyamide with 
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thiophenol, NaH, DMF, at 100 °C for 120 min provides polyamide 2 after reverse phase HPLC 
purification. 

Figure 12 shows the determination of the binding orientation of hairpin polyamides 

5 ImlmPyPy.Y-ImHpPyPy-p-Dp-EDTA-Fe(II) 2-E*Fe(II) and ImlmHpPy-y-ImPyPyPy-p-Dp- 
EDTA*Fe(II) 3-E*Fe(II) by affinity cleaving footprint titration. Top and bottom left: Affinity 
cleavage experiments on a V 32 P labeled 250-bp pJK6 EcoKU Pvu II restriction fragment. The 
5'-TGGACA-3* and 5'-TGGTCA-3' sites are shown on the right side of the autoradiogram. 
Top left: lane 1, adenine-specific chemical sequencing reaction; lanes 2-6, 6.5 ^M, 1.0 nM, 

10 100 nM, 10 nM, 1 nM polyamide 2-E*Fe(II); lane 7, intact restriction fragment, no polyamide 
added. Bottom left: lane 1, A reaction; lanes 2-6, 8.5 ^M, 1.0 fiM, 100 nM, 10 nM, 1 nM 
polyamide 3-E*Fe(II); lane 7, intact DNA. All reactions were carried out in a total volume of 
40 nL. A stock solution of polyamide or H 2 0 was added to a solution containing 20 kcpm 
labeled restriction fragment, affording final solution conditions of 25 mM Tris-Acetate, 20 mM 

15 NaCl, 100 jiM/ bp calf thymus DNA, at pH 7.0. Solutions were allowed to equilibrate for a 
minimum of 4 h at 22°K before initiation of reactions. Affinity cleavage reactions were carried 
out as described White, S., Baird, E.E. & Dervan, P.B. Effects of the A*T/T»A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA. Biochemistry 35, 6147- 
6152 (1996). Top and bottom right: Affinity cleavage patterns of 2-E»Fe(II) and 3-E*Fe(II) at 

20 100 nM bound to 5'-TGGACA-3' and 5'-TGGTCA-3\ Bar heights are proportional to the 
relative cleavage intensities at each base pair. Shaded and nonshaded circles denote imidazole 
and pyrrole carboxamides, respectively. Nonshaded diamonds represent the p-alanine moiety. 
A curved line represents the y-aminobutyric acid, and the + represents the positively charged 
dimethylaminopropylamide tail group. The boxed Fe denotes the EDTA-Fe(II) cleavage 

25 moiety. 

Figure 13 shows quantitative DNase I footprint titration experiments with the 
polyamides ImPyPyPyPy-y-ImPyPyPyPy-P-Dp and ImHpPyPyPy-y-ImHpPyPyPy-p-Dp on the 
3' 32 P labeled 252-bp pJK7 EcoRl/ Pvu II restriction fragment. For ImPyPyPyPy-y- 

30 ImPyPyPyPy-P-Dp gel (left): lane 1, DNase I digestion products in the absence of polyamide; 
lanes 2-18, DNase I digestion products in the presence of 1 .0 nM, 500 nM, 200, 100, 65, 40, 25, 
15, 10, 6.5, 4.0, 2.5, 1.5, 1.0, 0.5, 0.2, 0.1 nM polyamide; lane 19, DNase I digestion products in 
the absence of polyamide; lane 20, intact restriction fragment; lane 21, guanine-specific 
chemical sequencing reaction; lane 22, adenine-specific chemical sequencing reaction. For 

35 ImHpPyPyPy-Y-ImHpPyPyPy-p-Dp gel (right): lane 1, intact DNA; lane 2, DNase I digestion 
products in the absence of polyamide; lanes 3-19, 1.0 jiM, 500 nM, 200, 100, 50, 20, 10, 5, 2, I, 
0.5, 0.2. 0.1, 0.05, 0.01, 0.005, 0.001 nM polyamide; lane 20, DNase I digestion products in the 
absence of polyamide; lane 21, A reaction. AH reactions were done in a total volume of 400 
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^L. A polyamide stock solution or H 2 0 was added to an assay buffer containing radiolabeled 
restriction fragment, with the final solution conditions of 10 mM Tris-HCl, 10 mM KC1, 10 mM 
MgCl 2 , 5 mM CaCl 2 , pH 7.0. Solutions were allowed to equilibrate for 4-12 h at 22°C before 
initiation of footprinting reactions. Footprinting reactions, separation of cleavage products, and 
5 data analysis were carried as described. White, S., Baird, E.E. & Dervan, P.B. Effects of the 
A-T/T-A degeneracy of pyrrole-imidazole polyamide recognition in the minor groove of DNA. 
Biochemistry 35, 6147-6152 (1996). 

Fig. 14 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
10 the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X 1 X 2 X3X4-y-X 5 X 6 X 7 X 8 wherein y is the -NH-CH 2 -CH r CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X 4 /X 5 , X 3 /X 6 , X 2 /X 7 , and X,/X 8 represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
15 WGTNNW-3', where the 5'-GTNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 S'-WGTNNW^' sequences are drawn as binding models 
20 where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 

Fig. 15 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
25 the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X,X 2 X3X4-y-X 5 X 6 X7X8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X 4 /X 5 , X 3 /X 6 , X 2 /X 7 , and Xj/Xg represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
30 WGANNW-3\ where the 5'-GANN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGANNW-3' sequences are drawn as binding models 
35 where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 



13 



WO 98/37066 



PCT/US98/01006 



Fig. 16 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X^^X^y-XsX^Xg wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X«/X 5 , X 3 /X 6 , X 2 /X 7 , and X,/X 8 represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGGNNW-3\ where the 5'-GGNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5*-WGGNNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 



Fig. 17 shows the 8-ring Hp-Py-Im-polyamide hairpins described by the pairing code of 
the present invention. The eight ring hairpin template is shown at the top. A polyamide having 
the formula X,X 2 X3X4-y-X 5 X 6 X 7 X 8 wherein y is the -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin 
linkage derived from y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X 5 , X 3 /X 6 , X 2 /X 7 , and X,/X 8 represent carboxamide binding pairs 
which bind the DNA base pairs. The minor groove sequence to be bound is represented as 5'- 
WGCNNW-3', where the 5'-GCNN-3' core sequence is defined as position a, b, c, and d (W = 
A or T, N = A, G, C, or T). A linear sequence of aromatic amino acids fills the hairpin template 
in order to satisfy the ring pairing requirements to correspond to the DNA base pairs in the 
minor groove to be bound. The ring pairing code as applied is listed in Table 2. The 16 unique 
hairpin polyamides which target 16 5'-WGCNNW-3' sequences are drawn as binding models 
where filled and unfilled circles represent imidazole and pyrrole rings respectively; circles 
containing an H represent 3-hydroxypyrrole, and the curved line connecting the polyamide 
subunits represents y-aminobutyric acid. 



Four-ring polyamide subunits, covatently coupled to form eight-ring haiipin structures, 
bind specifically to 6-bp target sequences at subnanomolar concentrations. Trauger, J.W., 
Baird, E. E. & Dervan, P.B. describe the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996); Swalley, S. E., Baird, E. E. & 
Dervan, P. B. describe the discrimination of 5'-GGGG-3\ 5'-GCGC-3\ and S'-GGCC^' 
sequences in the minor groove of DNA by eight-ring hairpin polyamides. J. Am. Chem. Soc. 
119, 6953-6961 (1997). The DNA-binding affinities of three eight-ring hairpin polyamides 
shown in Figure 1 as compound 1, 2, and 3 containing pairings of Im/Py, Py/Im opposite G»C, 
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OG and either Py/Py, Hp/Py, or Py/Hp at a common single point opposite T«A and A«T has 
been determined. Equilibrium dissociation constants (KJ) for ImlmPyPy-y-ImPyPyPy-p-Dp 1, 
ImImPyPy-y-ImHpPyPy-p-E)p 2, ImlmHpPy-y-ImPyPyPy-p-Dp 3 of Figure 1 are shown in 
Table 1. Brenowitz, M., Senear, D. F., Shea, M A. & Ackers, G. K. describe a quantitative 
DNase footprint titration method for studying protein-DNA interactions. Methods EnzymoL 
130, 132-181 (1986); The K<j values were determined by quantitative DNase I footprint 
titration experiments: on a 3' 32 P-labeled 250-bp DNA fragment containing the target sites, 5'- 
TGGACA-3' and S'-TGGICA-S' which differ by a single A*T base pair in the fourth position. 
The DNase footprint gels are shown in Figure 3. 
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K d = 0.48 nM 



tf d =37 nM 



*The reported dissociation constants are the average values obtained from three 
DNase I footprint titration experiments. The standard deviation for each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris'HCl 10 mM KCl, 10 mM MgCl 2 , and 5 mM CaCi 2 at pH 7.0 and 22 °C 
tRing pairing opposite T»A and A # T in the fourth position. 
^Calculated as K d (5'-TGGACA-3')/K d (5 / -TGGTC A-3')- 



Based on the pairing rules for polyamide-DNA complexes both of these sequences are a 
match for control polyamide 1 which places a Py/Py pairing opposite 

A»T and T>A at both sites. It was determined that in polyamide 1 (Py/Py) binds to 5'- 
TGG1CA-3' and 5*-TGGACA-3' within a factor of 2 (K^ = 0.077 or 0.15 nM respectively). In 
contrast, polyamide 2 (Py/Hp) binds to 5 , -TGGICA-3 > and 5 , -TGGACA-3' with dissociation 
constants which differ by a factor of 18 (K^ = 15 nM and 0.83 nM respectively). By reversing 
the pairing in polyamide 3 (Hp/Py) the dissociation constants differ again in the opposite 
direction by a factor of 77 (K D = 0.48 nM and 37 nM respectively. Control experiments 
performed on separate DNA fragments; reveal that neither a S'-TGGGCA-S' or a 5'-TGGCCA- 
3' site is bound by polyamide 2 or 3 at concentrations < 100 nM, indicating that the Hp/Py and 
Py/Hp ring pairings do not bind opposite G»C or OG. The A»T vs. T*A discrimination is 
achieved preferably when the two neighboring base pairs are G«C and OG (GTC vs. GAC). 
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The specificity of polyamides 2 and 3 for sites which differ by a single A»T/T»A base 
pair results from small chemical changes. Replacing the Py/Py pair in 1 with a Py/Hp pairing 
as in 2, a single substitution of C3-OH for C3-H, destabilizes interaction with 5'-TGGTCA-3' 
by 191-fold, a free energy difference of 3.1 kcal mol" 1 . Interaction of 2 with 5'-TGGACA-3' is 
destabilized only 6-fold relative to 1, a free energy difference of 1.1 kcal mol 1 . Similarly, 
replacing the Py/Py pair in 1 with Hp/Py as in 3 destabilizes interaction with 5'-TGGACA-3' 
by 252-fold, a free energy difference of 3.2 kcal mof'. Interaction of 3 with 5'TGGICA-3' is 
destabilized only 6-fold relative to 1, a free energy difference of 1 .0 kcal mof 1 . 



The polyamides of this invention provide for coded targeting of predetermined DNA 
sequences with affinity and specificity comparable to sequence-specific DNA binding proteins. 
Hp, Im, and Py polyamides complete the minor groove recognition code using three aromatic 
amino acids which combine to form four ring pairings (Im/Py, Py/Im, Hp/Py, and Py/Hp) which 
complement the four Watson-Crick base pairs, as shown in TABLE 2. There are a possible 240 
15 four base pair sequences which contain at least 1 A»T or T»A base pair and therefore can 
advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides binding to any of 
these sequences can be designed in accordance with the code of TABLE 2. 



TABLE 2 Pairing code for minor groove recognition* 
Pair G'C C»G T«A A*T 

Im/Py + 

Py/Im + 

Hp/Py + 

Py/Hp + 



* favored (+), disfavored (-) 
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For certain G«C rich sequences the affinity of polyamide^DNA complexes may be 
enhanced by substitution of an Im/p pair for Im/Py at G*C and p/Im for Py/Im at OG. At A*T 
and T-A base pairs, either a Py/p, p/Py, and p/p may be used. The alternate aliphatic/aromatic 
amino acid pairing code is described in Table 3. 



TABLE 3 Aliphatic/ Aromatic substitution for ring 


pairings* 




Pair 


Substitution 


Im/Py 


Im/p 


Py/Im 


p/lm 


Hp/Py 


Py/p,p/Py,Hp/p t p/p 


Py/Hp 


Py/p, p/Py, p/Hp,p/p 



U. S. Patent 5,578,444 describes numerous promoter region targeting sequences from 
which base pair sequences for targeting a polyamide can be identified. 

PCT U.S. 97/003332 describes methods for synthesis of polyamides which are suitable 
for preparing polyamides of this invention. The use of p-alanine in place of a pyrrole amino 
acid in the synthetic methods provides aromatic/aliphatic pairing (Im/p, p/Im, Py/p, and p/Py) 
and aliphatic/aliphatic pairing (p/p) substitution. The use of y-aminobutyric acid, or a 
substituted y-aminobutyric acid such as (R)-2,4 diaminobutyric acid, provides for preferred 
hairpin turns. The following examples illustrate the synthesis of polyamides of the present 
invention. 
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Example 1: 

PREPARATION OF A PROTECTED Hp MONOMER FOR SOLID PHASE 

SYNTHESIS. 

Distamycin and its analogs have previously been considered targets of traditional 
multistep synthetic chemistry. Arcamone, F., Orezzi, P. G., Barbieri, W., Nicolella, V. & Penco, 
S. describe a solution phase synthesis of distamycin Gazz. Chim. Ital. 1967, 97, 1097. The 
repeating amide of distamycin is formed from an aromatic carboxylic acid and an aromatic 
amine. The aromatic acid is often unstable to decarboxylation, and the aromatic amines have 
been found to be air and light sensitive. Lown, J. W. & Krowicki, K. describe a solution phase 
synthesis of Distamycin J. Org. Chem. 1985, SO, 3774. The variable coupling yields, long 
reaction times (often >24 h), numerous side products, and reactive intermediates (acid chlorides 
and trichloro ketones) characteristic of the traditional solution phase coupling reactions make 
the synthesis of the aromatic carboxamides problematic. B. Merrifield describes the solid phase 
synthesis of a tetrapeptide J. Am. Chem. Soc. 1963, 85, 2149. In order to implement an efficient 
solid phase methodology for the synthesis of the pyrrole- imidazole polyamides, the following 
components were developed: (1) a synthesis which provides large quantities of appropriately 
protected monomer or dimer building blocks in high purity, (2) optimized protocols for forming 
an amide in high yield from a support-bound aromatic amine and an aromatic carboxylic acid, 
(3) methods for monitoring reactions on the solid support, and (4) a stable resin linkage agent 
20 that can be cleaved in high yield upon completion of the synthesis. Baird, E. E. & Dervan, P. B. 
describes the solid phase synthesis of polyamides containing imidazole and pyrrole amino 
acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also see PCT US 97/003332. In order to 
prepare polyamides which contain the 3-hydroxypyrrole monomer, a synthesis has been 
developed which allows the appropriately protected Boc-Op acid monomer to be prepared on 50 
25 g scale. 1H NMR and ,3 C NMR spectra were recorded on a General Electric-QE 300 NMR 
spectrometer in CD 3 OD or DMSO-</ 6 , with chemical shifts reported in parts per million relative 
to residual CHD 2 OD or DMSO-d 5 , respectively. IR spectra were recorded on a Perkin-Elmer 
FTIR spectrometer. High-resolution mass spectra were recorded using fast atom bombardment 
(FABMS) techniques at the Mass Spectrometry Laboratory at the University of California, 
30 Riverside. Reactions were executed under an inert argon atmosphere. Reagent grade chemicals 
were used as received unless otherwise noted. Still, W. C, Kahn, M. & Mitra, A. describe flash 
column chromatography J. Org. Chem. 1978, 40, 2923-2925. Flash chromatography was 
carried out using EM science Kieselgel 60 (230-400) mesh. Thin-layer chromatography was 
performed on EM Reagents silica gel plates (0.5 mm thickness). All compounds were 
35 visualized with short-wave ultraviolet light. 



18 



WO 98/37066 



PCT/US98/01006 



Table 4 rlntermediates for preparation of Boc-protected 3-methoxypyrrole 


NAME 


STRUCTURE 


Ethyl 4-carboxy-3-hydfoxy-l- 
methyIpyrrole-2-carboxylate. 


O 

Jl OH 

' 0 


Ethyl 4-[(Benzyloxycarbonyl)amino]-3- 
hydroxy-l-methylpyrrole-2-carboxylate 


o tl OH 


Ethyl 4-[(Benzyloxycarbonyl)aminol-3- 
methoxy-1 -methyl pyrrole- 2-carboxy late 


O N OMe 


Ethyl 4-[(tert-Butyloxycarbonyl)amino]-3- 
methoxy-l-methylpyrrole-2-carboxylate 


1 o 


4-[(tert-Buryloxycarbonyl)amino]-3-methoxy 
-l-methylpyrrole-2-carboxyIic acid 


1 o 



£f/*y/ 4-[(benzyloxycarbonyl)am\no]-$-hydroxy-l-methy\py Ethyl-4- 
carboxy-3-hydroxy-l-methylpyrrole-2-carboxylate (60 g, 281.7 mmol) was dissolved in 282 
mL acetonitrile. TEA (28.53 g, 282 mmol) was added, followed by diphenylphosphorylazide 
(77.61 g, 282 mmol). The mixture was refluxed for 5 hours, followed by addition of benzyl 
alcohol (270 ml) and reflux continued overnight. The solution was cooled and volitiles 
removed in vacuo. The residue was absorbed onto silca and chromatagraphed, 4:1 hexanes : 
ethyl acetate, to give a white solid (21.58 g, 24%) l E NMR (DMSO-d6) 5 8.73 (s, 1H), 8.31 (s, 
1H), 7.31 (m, 5H), 6.96 (s, 1H), 5.08 (s, 2H), 4.21 (q, 2H, J = 7.1 Hz), 3.66 (s, 3H), 1.25 (t, 3H, 
J = 7.1 Hz); MS m/e 319.163 (M+H 319.122 calcd. for C16H18N2O5). 

Ethyl 4-[(tert-butoxycarbonyl)amino]-3-methoxy-l-metfc^ carboxylate. Ethyl 

4-[(benzyloxycarbonyl)amino]0-hydroxy-l-methylpyrrole-2-carboxylate (13.4 g, 42.3 mmol) 
was dissolved in 110 mL acetone. Anhydrous K2CO3 (11.67 g, 84.5 mmol) was added, 
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followed by methyliodide (5.96 g, 42.3 mmol) and dimethylaminopyridine (0.5 g, 4.23 mmol) 
and the mixture stirred overnight. The solid K2CO3 was removed by filtration and 200 ml 
water added. Volitiles were removed in vacuo and the solution made acidic with addition of IN 
H2SO4 . The aqueous layer was extracted with diethyl ether. Organic layers were combined, 

5 washed with 10% H2SO4, dried over MgS04, and dried to give a white solid. The solid was 
used without further purification and dissolved in 38 ml DMR DIEA (1 1 ml), Boc anhydride 
(9.23 g, 42.3 mmol), and 10 % Pd/C (500 mg) were added and the solution stirred under 
hydrogen (1 atm) for 2.1 h. The slurry was filtered through celite which was washed with 
methanol. Water 250 ml was added and volitiles removed in vacuo. The aqueous layer was 

10 extracted with ether. Organic layers were combined, washed with water and brine, and dried 
over MgS04. Solvent was removed in vacuo to give a white solid ( 8.94 g, 71%) *H NMR 
(DMSO-d 6 ) 5 8.43 (s, 1H), 7.03 (s, 1H), 4.19 (q, 2H, J = 7.1 Hz), 3.70 (s, 3H), 3.67 (s, 3H), 
1.42 (s, 9H), 1.26 (t, 3H, J = 7.1); MS m/e 299.161 (M+H 299.153 calcd. for C14H22N2O5). 

15 Ethyl 4-[(benzyloxycarbonyl)amino]-3-hydroxy-l-methylpy Ethyl 4- 

[(/err-butoxycarbonyl)amino]-3-methoxy-l-methylpyrrole-2-carboxylate (9.0 g, 30.2 mmol) 
was dissolved in 30 mL ethanol. NaOH (30 ml, 1 M, aq) was added and the solution stirred for 
4 days. Water (200 ml) was added and ethanol removed in vacuo. The solution was extracted 
with diethyl ether, aqueous layer acidified to pH = 2-3, and extracted again with diethyl ether. 

20 Organic layers were dried over MgS04, and solvent removed in vacuo to give a white solid (6.0 
g, 20.5 mmol, 87% based on recovered SM) ! H NMR (DMSO-d6) 5 12.14 (s, 1H), 8.37 (s, 
1H), 6.98 (s, 1H), 3.69 (s, 3H), 3.66 (s, 3H), 1.42 (s, 9H); MS m/e 293.112 (M+H 293.104 
calcd. forCl2Hi8N205). 

25 EXAMPLE 2: 

SOLID PHASE SYNTHESIS OF 3-HYDROXYPYRROLE POLYAMIDES. 

Cycling protocols were optimized to afford high stepwise coupling yields (>99%). 
Deprotection by aminolysis affords up to 100 mg quantities of polyamide. Solid phase 

30 polyamide synthesis protocols were modified from the in situ neutralization Boc-chemistry 
protocols. Schnolzer, M., Alewood, P., Jones, A., Alewood, D., Kent, S.B.H. report rapid in situ 
neutralization for solid phase peptide synthesis Int. J. Peptide. Protein. Res. 1992, 40, 180. 
Coupling cycles are rapid, 72 min per residue for manual synthesis or 1 80 min per residue for 
machine-assisted synthesis, and require no special precautions beyond those used for ordinary 

35 solid phase peptide synthesis. Manual solid phase synthesis of a pyrrole-imidazole polyamide 
consists of a dichloromethane (DCM) wash, removal of the Boc group with trifluoroacetic acid 
(TFA)/DCM/thiophenol (PhSH), a DCM wash, a DMF wash, taking a resin sample for analysis, 
addition of activated monomer, addition of DIEA if necessary, coupling for 45 min, taking a 
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resin sample for analysis, and a final DMF wash (Figure 5, Table I). In addition, the manual 
solid phase protocol for synthesis of pyrrole-imidazole polyamides has been adapted for use on 
a ABI 430A peptide synthesizer. The aromatic amine of the pyrrole and imidazole do not react 
in the quantitative ninhydrin test. Stepwise cleavage of a sample of resin and analysis by HPLC 
indicates that high stepwise yields (> 99%) are routinely achieved. 

Dicyclohexylcarbodiimide (DCC), Hydroxybenzotriazole (HOBt), 2-(lH-Benzotriazole- 
l-yl)-l,l,3,3-tetramethyluronium hexa-fluorophosphate (HBTU) and 0.2 mmol/gram Boc-f> 
alanine-(-4-carboxamidomethyl)-benzyl-ester-copoly(styrene-divinylbenzene) resin (Boc-p- 
Pam-Resin) was purchased from Peptides International (0.2 mmol/gram), NovaBiochem (0.6 
mmol/gram), or Peninsula (0.6 mmol/gram). ( (/c)-2-Fmoc-4-Boc-diaminobutyric acid, (5)-2- 
Fmoc-4-Boc-diaminobutyric acid, and (/c)-2-amino-4-Boc-diaminobutyric acid were purchased 
from Bachem. MW-diisopropylethylamine (DIEA), jV,W-dimethylformamide (DMF), N- 
methylpyrrolidone (NMP), DMSO/NMP, Acetic anhydride (Ac 2 0), and 0.0002 M potassium 
cyanide/pyridine were purchased from Applied Biosystems. Dichloromethane (DCM) and 
triethylamine (TEA) were reagent grade from EM, thiophenol (PhSH), 
dimethylaminopropylamine (Dp), Sodium Hydride, (*)-a-methoxy-o> 
(trifuoromethyOphenylacetic acid ((/?)MPTA) and (5)-a-methoxy-a- 
(trifouromethyl)phenylacetic acid ((S)MPTA) were from Aldrich, trifluoroacetic acid (TFA) 
Biograde from Halocarbon, phenol from Fisher, and ninhydrin from Pierce. All reagents were 
used without further purification. 

Quik-Sep polypropylene disposable filters were purchased from Isolab Inc. 'H NMR 
spectra were recorded on a General Electric-QE NMR spectrometer at 300 MHz with chemical 
shifts reported in parts per million relative to residual solvent. UV spectra were measured in 
water on a Hewlett-Packard Model 8452A diode array spectrophotometer. Optical rotations 
were recorded on a JASCO Dip 1000 Digital Polarimeter. Matrix-assisted, laser 
desorption/ionization time of flight mass spectrometry (MALDI-TOF) was performed at the 
Protein and Peptide Microanalytical Facility at the California Institute of Technology. HPLC 
analysis was performed on either a HP 1090M analytical HPLC or a Beckman Gold system 
using a RAINEN C, 8 , Microsorb MV, 5um, 300 x 4.6 mm reversed phase column in 0.1% 
(wt/v) TFA with acetonitrile as eluent and a flow rate of 1.0 mL/min, gradient elution 1.25% 
acetonitrile/min. Preparatory reverse phase HPLC was performed on a Beckman HPLC with a 
Waters DeltaPak 25 x 100 mm, 100 um C18 column equipped with a guard, 0.1% (wt/v) TFA, 
0.25% acetonitrile/min. 18MQ water was obtained from a Millipore MilliQ water purification 
system, and all buffers were 0.2 um filtered. 
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Activation of Boc-3-methoxypyrrole acid. The amino acid (0.5 mmol) was dissolved in 2 mL 
DMF. HBTU (190 mg, 0.5 mmol) was added followed by DIEA (1 mL) and the resulting 
mixture was shaken for 5 min. 

5 Activation of Imidazole-2-carboxylic acid, y-aminobutyric acid, Boc-giycine, and Boc-P- 
alanine. The appropriate amino acid or acid (2 mmol) was dissolved in 2 mL DMF. HBTU 
(720 mg, 1.9 mmol) was added followed by DIEA (1 mL) and the solution shaken for at least 5 
min. 

10 Activation of Boc-Imidazole acid. Boc imidazole acid (257 mg, 1 mmol) and HOBt (135 mg, 
1 mmol) were dissolved in 2 mL DMF, DCC (202 mg, 1 mmol) is then added and the solution 
allowed to stand for at least 5 min. 

Acetylation Mix. 2 mL DMF, DIEA (710 jiL, 4.0 mmol), and acetic anhydride (380 jiL, 4.0 
15 mmol) were combined immediately before use. 

Manual Synthesis Protocol. Boc-B-alanine-Pam-Resin (1.25 g, 0.25 mmol) is placed in a 20 
mL glass reaction vessel, shaken in DMF for 5 min and the reaction vessel drained. The resin 
was washed with DCM (2 x 30 s.) and the Boc group removed with 80% TFA/DCM/0.5M 

20 PhSH, 1 x 30s., 1 x 20 min The resin was washed with DCM (2 x 30 s.) followed by DMF (1 x 
30 s.) A resin sample (5-10 mg) was taken for analysis. The vessel was drained completely and 
activated monomer added, followed by DIEA if necessary. The reaction vessel was shaken 
vigorously to make a slurry. The coupling was allowed to proceed for 90 min, and a resin 
sample taken. Acetic anhydride (1 mL) was added and the reaction shaken for 5 min. The 

25 reaction vessel was then washed with DMF, followed by DCM. 

Machine-Assisted Protocols. Machine-assisted synthesis was performed on a ABI 430A 
synthesizer on a 0.18 mmol scale (900 mg resin; 0.2 mmol/gram). Each cycle of amino acid 
addition involved: deprotection with approximately 80% TFA/DCM/0.4M PhSH for 3 minutes, 

30 draining the reaction vessel, and then deprotection for 17 minutes; 2 dichloromethane flow 
washes; an NMP flow wash; draining the reaction vessel; coupling for 1 hour with in situ 
neutralization, addition of dimethyl sulfoxide (DMSO)/NMP, coupling for 30 minutes, addition 
of DIEA, coupling for 30 minutes; draining the reaction vessel; washing with DCM, taking a 
resin sample for evaluation of the progress of the synthesis by HPLC analysis; capping with 

35 acetic anhydride/DIEA in DCM for 6 minutes; and washing with DCM. A double couple cycle 
is employed when coupling aliphatic amino acids to imidazole, all other couplings are 
performed with single couple cycles. 
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The ABI 430A synthesizer was left in the standard hardware configuration for NMP- 
HOBt protocols. Reagent positions 1 and 7 were DIEA, reagent position 2 was TFA/0.5M 
thiophenol, reagent position 3 was 70% ethanolamine/methanol, reagent position 4 was acetic 
anhydride, reagent position 5 was DMSO/NMP, reagent position 6 was methanol, and reagent 
5 position 8 was DMFr New activator functions were written, one for direct transfer of the 
cartridge contents to the concentrator (switch list 21, 25, 26, 35, 37, 44), and a second for 
transfer of reagent position 8 directly to the cartridge (switch list 37, 39, 45, 46). 

Boc-Py-OBt ester (357 rng, 1 mmol) was dissolved in 2 ml DMF and filtered into a 
10 synthesis cartridge. Boc-Im acid monomer was activated (DCC/HOBt), filtered, and placed in a 
synthesis cartridge. Imidazole-2-carboxylic acid was added manually. At the initiation of the 
coupling cycle the synthesis was interrupted, the reaction vessel vented and the activated 
monomer added directly to the reaction vessel through the resin sampling loop via syringe. 
When manual addition was necessary an empty synthesis cartridge was used. Aliphatic amino 
15 acids (2 mmol) and HBTU (1.9 mmol) were placed in a synthesis cartridge. 3 ml of DMF was 
added using a calibrated delivery loop from reagent bottle 8, followed by calibrated delivery of 
1 ml DIEA from reagent bottle 7, and a 3 minute mixing of the cartridge. 

The activator cycle was written to transfer activated monomer directly from the cartridge to 
20 the concentrator vessel, bypassing the activator vessel. After transfer, 1 ml of DIEA was 
measured into the cartridge using a calibrated delivery loop, and the DIEA solution combined 
with the activated monomer solution in the concentrator vessel. The activated ester in 2:1 
DMF/DIEA was then transferred to the reaction vessel. All lines were emptied with argon 
before and after solution transfers. 

25 

ImImOpPy-y-ImPyPyPy-$-Dp. ImlmOpPy-y-ImPyPyPy-p-Parn-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyamides 
containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 (1996); also 

30 see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImlmOpPy-Y-ImPyPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL 
peptide synthesis vessel and treated with neat dimethylaminopropylamine (2 mL) and heated 

35 (55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImlmOpPy-y-ImPyPyPy-p-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (97 mg, 49% recovery). UV (H 2 0) 246, 316 (66,000); ! H NMR (DMSO-tf*) 
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S 10.24 (s, 1 H), 10.14 (s, 1 H), 9.99 (s, 1 H), 9.94 (s, 1 H), 9.88 (s, 1 H), 9.4 (br s, 1 H), 9.25 (s, 
1 H), 9.1 1 (s, 1 H), 8.05 (m, 3 H), 7.60 (s, 1 H), 7.46 (s, 1 H), 7.41 (s, 1 H), 7.23 (d, 1), 7.21 (d, 
1 H), 7.19 (d, 1 H), 7.13 (m, 2 H), 7.11 (m, 2 H), 7.02 (d, 1 H), 6.83 (m, 2 H), 3.96 (s, 6 H), 
3.90 (s, 3 H), 3.81 (m, 6 H), 3.79 (s, 3 H), 3.75 (d, 9 H), 3.33 (q, 2 H, J= 5.4 Hz), 3.15 (q, 2 H, 
5 J- 5.5 Hz), 3.08 (q, 2 H, J = 6.0 Hz), 2.96 (quintet, 2 H, J = 5.6 Hz), 2.70 (d, 6 H, J- 4.5 Hz), 
2.32 (m, 4 H), 1.71 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1253.5 (1253.6 calc. for 

C58H72N220,,). 

ImlmHpPy-y-ImPyPyPy. In order to remove the methoxy protecting group, a sample of 

10 ImlmOpPy-y-ImPyPyPy-P-Dp (5 mg, 3.9 ^imol) was treated with sodium thiophenoxide at 100 
°C for 2 h. DMF (1000 \iL) and thiophenol (500 \xh) were placed in a (13 x 100 mm) disposable 
Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmOpPy-y-ImPyPyPy- 
p-Dp (5 mg) dissolved in DMF (500 ^iL) was added. The solution was agitated, and placed in a 

15 100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 
aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImPyPyPy-P-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (3.8 mg, 77 % recovery). UV (H 2 0) K** 246, 312 (66,000); *H NMR 

20 (DMSO-rf 6 ) 8 10.34 (s, 1 H), 10.24 (s, 1 H), 10.00 (s, 2 H), 9.93 (s, 1 H), 9.87 (s, 1 H), 9.83 (s, 
1 H), 9.4 (br s, 1 H), 9.04 (s, 1 H), 8.03 (m, 3 H), 7.58 (s, 1 H), 7.44 (s, 1 H), 7.42 (s, 1 H), 7.23 
(s, 1 H), 7.20 (m, 3 H), 7.12 (m, 2 H), 7.05 (M H), 7.02 (d, 1 H), 6.83 (s, 1 H), 6.79 (s, 1 H), 
3.96 (s, 6 H), 3.90 (s, 3 H), 3.81 (s, 6 H), 3.79 (s, 3 H), 3.75 (d, 6 H), 3.33 (q, 2 H, J= 5.4 Hz), 
3.14 (q, 2 H, J = 5.4 Hz), 3.08 (q, 2 H, J= 6.1 Hz), 2.99 (quintet, 2 H, J = 5.4 Hz), 2.69 (d, 6 H, 

25 J = 4.2 Hz), 2.31 (m, 4 H), 1.72 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 
calc. for C 5 7H7,N 2 20n). 

ImlmPyPy-y-ImOpPyPy-fi-Dp. ImlmPyPy-y-ImOpPyPy-p-Pam-Resin was synthesized 
in a stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 

30 mmol/g) as described for ImlmOpPy-y-ImPyPyPy-p-Dp. A sample of ImlmPyPy-y-ImOpPyPy- 
P-Pam-Resin (400 mg, 0.40 mmol/gram) was placed in a glass 20 mL peptide synthesis vessel 
and treated with neat dimethylaminopropylamine (2 mL) and heated (55 °C) with periodic 
agitation for 16 h. The reaction mixture was then filtered to remove resin, 0.1% (wt/v) TFA 
added (6 mL) and the resulting solution purified by reversed phase HPLC. ImlmPyPy-y- 

35 ImOpPyPy-P-Dp is recovered upon lyophilization of the appropriate fractions as a white 
powder (101 mg, 50% recovery). UV (H 2 0) 246, 316 (66,000); MALDI-TOF-MS 
(monoisotopic), 1253.6(1253.6 calc. for CsgHyj^On). 



24 



WO 98/37066 



PCT/US98/01006 



ImlmPyPy-y-ImHpPyPy. A sample of ImlmPyPy-y-ImOpPyPy-p-Dp (5 mg, 3.9 ^mol) 
was treated with sodium thiophenoxide and purified by reversed phase HPLC as described for 
ImlmHpPy-y-ImPyPyPy-p-Dp. ImlmPyPy-y-ImHpPyPy-P-Dp is recovered upon lyophilization 
of the appropriate fractions as a white powder (3.2 mg, 66 % recovery). UV (H 2 0) 246, 
5 312 (66,000); MALDI-TOF-MS (monoisotopic), 1239.6 (1239.6 catc. for C 57 H 7 ,N 22 0„). 

ImPyPy-y-OpPyPy-$-Dp. ImPyPy-y-OpPyPy-p-Pam-Resin was synthesized in a 
stepwise fashion by machine-assisted solid phase methods from Boc-P-Pam-Resin (0.66 
mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of polyamides 
containing imidazole and pyrrole amino acids. 1 Am. Chem. Soc. 118, 6141-6146 (1996); also 
see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was incorporated by 
placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine synthesis cartridge. 
Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation occurs. A sample of 
ImPyPy-y-OpPyPy-P-Pam-Resin (400 mg, 0.45 mmol/gram) was placed in a glass 20 mL 
peptide synthesis vessel and treated with neat dimethyiaminopropylamine (2 mL) and heated 
(55 °C) with periodic agitation for 16 h. The reaction mixture was then filtered to remove resin, 
0.1% (wt/v) TFA added (6 mL) and the resulting solution purified by reversed phase HPLC. 
ImPyPy-y-OpPyPy-P-Dp is recovered upon lyophilization of the appropriate fractions as a 
white powder (45 mg, 25% recovery). UV (H 2 0) 7^ x 246, 310 (50,000); ! H NMR (DMSO-rf 5 ) 
5 10.45 (s, 1 H), 9.90 (s, 1 H), 9.82 (s, 1 H), 9.5 (br s, 1 H), 9.38 (s, 1 H), 9.04 (s, 1 H), 8.02 (m, 
3 H), 7.37 (s, I H), 7.25 (m, 2 H), 7.15 (d, 1 H, J= 1.6 Hz), 7.1 1 (m, 2 H), 7.09 (d, 1 H), 7.03 
(d, 1 H), 6.99 (d, 1 H), 6.87 (d, 1 H), 6.84 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 3.77 (s, 6 H), 
3.76 (s, 3 H), 3.74 (s, 1 H), 3.34 (q, 2 H, J= 5.6 Hz), 3.20 (q, 2 H, J = 5.8 Hz), 3.09 (q, 2 H, J = 
6. 1 Hz), 2.97 (quintet, 2 H, J = 5.3 Hz), 2.70 (d, 6 H, J = 3.9 Hz), 2.34 (m, 4 H), 1 .73 (m, 4 H); 
MALDI-TOF-MS (monoisotopic), 1007.6 (1007.5 calc. for C 48 H 6 ,N 16 0 9 ). 

ImPyPy-y-HpPyPy. In order to remove the methoxy protecting group, a sample of 
ImPyPy-y-OpPyPy-p-Dp (5 mg, 4.8 nmol) was treated with sodium thiophenoxide at 100 °C 
for 2 h. DMF (1000 nL) and thiophenol (500 ^iL) were placed in a (13 x 100 mm) disposable 
30 Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in mineral oil (100 mg) was 
slowly added. Upon completion of the addition of the sodium hydride, ImlmPyPy-y-ImOpPyPy- 
P-Dp (5 mg) dissolved in DMF (500 jiL) was added. The solution was agitated, and placed in a 
100 °C heat block, and deprotected for 2 h. Upon completion of the reaction the culture tube 
was cooled to 0°C, and 7 ml of a 20 % (wt/v) solution of trifluoroacetic acid added. The 
35 aqueous layer is separated from the resulting biphasic solution and purified by reversed phase 
HPLC. ImlmHpPy-y-ImHpPyPy-p-Dp is recovered as a white powder upon lyophilization of 
the appropriate fractions (2.5 mg, 52 % recovery). UV (H 2 0) U 246, 310 (50,000); l H NMR 
(DMSO-^) 5 10.44 (s, 1 H), 10.16 (s, 1 H), 9.90 (s, 1 H), 9.77 (s, 1 H), 9.5 (br s, 1 H), 9.00 (s, 
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1 H), 8.03 (m, 3 H), 7.37 (s, 1 H), 7.26 (m, 2 H), 7.14 (d, 1 H, J= 1.7 Hz), 7.12 (m, 2 H), 7.02 
(d, 1 H), 6.93 (d, 1 H), 6.88 (d, 1 H), 6.82 (d, 1 H), 6.72 (d, 1 H), 3.96 (s, 3 H), 3.81 (s, 6 H), 
3.77 (s, 3 H), 3.76 (s, 3 H), 3.74 (s, 1 H), 3.36 (q, 2 H, J = 5.4 Hz), 3.22 (q, 2H,J= 5.9 Hz), 
3.09 (q, 2 H, J = 5.5 Hz), 2.98 (quintet, 2 H, J = 5.3 Hz), 2.70 (d, 6 H, J = 4.3 Hz), 2.34 (m, 4 
5 H), 1 .78 (m, 4 H); MALDI-TOF-MS (monoisotopic), 994.2 (993.5 calc. for C 47 H 61 N, 6 0 9 ). 



Table 5. Mass spectral characterization of Op and Hp polyamides, synthesized and purified as 

described for ImlmOpPy-y-ImPyPyPy-P-Dp and ImlmHpPy-y-ImPyPyPy-P-Dp. 

POLYAMIDE FORMULA (M+H)CALCD FOUND 



10 


ImOpPy-y-PyPyPy-p-Dp 


C 4 8H 63 N l6 0 9 


1007.5 


1007.5 




ImHpPy-Y-PyPyPy-P-Dp 


C 47 H 61 N 16 0 9 


993.5 


993.2 




ImPyOp-y-PyPyPy-P-Dp 


C 48 H 63 N 16 0 9 


1007.5 


1007.5 




ImPyHp-Y-PyPyPy-p-Dp 


C 47 H 61 N l6 0 9 


993.5 


993.4 




ImPyPy-Y-OpPyPy-p-Dp 


C 48 H 63 N 16 0 9 


1007.5 


1007.6 


15 


ImPyPy-Y-HpPyPy-P-Dp 


C 47 H 6l N I6 0 9 


993.5 


993.2 




ImPyPy-Y-PyOpPy-P-Dp 


C 48 H 63 N l6 0 9 


1007.5 


1007.5 




ImPyPy-Y-PyHpPy-p-Dp 


C 47 H 61 N 16 0 9 


993.5 


993.4 




ImOpOp-Y-PyPyPy-P-Dp 


v^^oiri a<N i aU i a 


1037.5 


1037.5 




ImHpHp-Y-PyPyPy-p-Dp 


C 47 H 6 |N| 6 0|o 


1009.5 


1009.4 


20 


ImlmOpPy-Y-ImPyPyPy-p-Dp 


C 5 8H72N 22 0„ 


1253.6 


1253.5 




ImlmHpPy-Y-ImPyPyPy-P-Dp 


C57H 71 N 22 0„ 


1239.6 


1239.6 




ImlmPyPy-Y-ImOpPyPy-P-Dp 


C S8 H 72 N 22 O n 


1253.6 


1253.6 




ImlmPyPy-Y-ImHpPyPy-p-Dp 


C 5 7H 7I N 22 0 M 


1239.6 


1239.6 




ImOpPyPy-Y-ImOpPyPy-p-Dp 


C 60 H 76 N 21 O 12 


1282.6 


1282.6 


25 


ImHpPyPy-Y-IniHpPyPy-P-Dp 


C58H 72 N 2 iOi 2 


1254.6 


1254.6 




ImlmOpPy-Y-ImOpPyPy-p-Dp 




1283.6 


1283.6 




ImlmHpPy-Y-IniHpPyPy-p-Dp 


C 57 H 7 ,N 22 0 12 


1255.6 


1255.5 




ImOpPyPy-Y-PyPyPyPy-P-Dp 


C6oH 7 sN 20 0|| 


1251.6 


1251.5 




ImPyPyPy-Y-PyPyOpPy-p-Dp 


C6oH 7 sN 2 oOn 


1251.6 


1251.5 


30 


ImlmPyPy-Y-IniPyOpPy-P-Dp 


C58H 72 N 22 On 


1253.6 


1253.7 




ImOpPyPyPy-Y-ImOpPyPyPy-p-Dp 


C 72 H 88 N 25 0|4 


1526.7 


1526.6 




ImHpPyPyPy-Y-ImHpPyPyPy-p-Dp 


C 70 H 8 4N 2 5O 14 


1498.7 


1498.0 




ImlmPyPyPy-Y-ImOpOpPyPy-P-Dp 


C 71 H 87 N 26 0,4 


1527.7 


1527.7 
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EXAMPLE 3: 
DETERMINATION OF POLY AMIDE 
BINDING AFFINITY AND SEQUENCE SPECIFICITY. 

5 Representative-footprint titration experiments are shown in Figures 3 and 10. A 252-bp 

DNA fragment which is typically used for the footprint titration experiments provides 247 
possible 6-bp binding sites for an eight-ring hairpin polyamide. Thus, in addition to providing 
DNA binding affinities, the footprint titration experiments also reveal DNA binding sequence- 
specificity. The DNA binding sequence specificity of poiyamides which differ by a single 

10 Py/Py, Hp/Py, or Py/Hp pair for sites which differ by a single A*T or T*A base pair are 
described in Tables 1, 6, and 7. 

Quantitative DNase I Footprint Titrations All reactions were executed in a total volume 
of 400 |iL (Brenowitz, M et al. y 1986). A polyamide stock solution or H 2 0 (for reference 

15 lanes) was added to an assay buffer containing 3*- 32 P radiolabeled restriction fragment (20,000 
cpm), affording final solution conditions of 10 mM Tris*HCl, 10 mM KC1, 10 mM MgCl 2 , 5 
mM CaCl 2 , pH 7.0, and either (i) a suitable concentration range of polyamide, or (ii) no 
polyamide (for reference lanes). The solutions were allowed to equilibrate for 24 hours at 22°C. 
Footprinting reactions were initiated by the addition of 10 nL of a stock solution of DNase I (at 

20 the appropriate concentration to give -55% intact DNA) containing 1 mM dithiothreitol and 
allowed to proceed for 7 minutes at 22°C. The reactions were stopped by the addition of 50 \xL 
of a solution containing 2.25 M NaCl, 150 mM EDTA, 23 |iM base pair calf thymus DNA, and 
0.6 mg/ml glycogen, and ethanol precipitated. The reactions were resuspended in 1 x TBE/ 
80% formamide loading buffer, denatured by heating at 85°C for 15 minutes, and placed on ice. 

25 The reaction products were separated by electrophoresis on an 8% polyacrylamide gel (5% 
crosslinking, 7 M urea) in 1 x TBE at 2000 V for 1.5 h. Gels were dried on a slab dryer and 
then exposed to a storage phosphor screen at 22°C. 

Photostimuable storage phosphor imaging plates (Kodak Storage Phosphor Screen 
30 SO230 obtained from Molecular Dynamics) were pressed flat against dried gel samples and 
exposed in the dark at 22°C for 12-24 hours. A Molecular Dynamics 400S Phosphorlmager 
was used to obtain all data from the storage screens (Johnston et al., 1990). The data were 
analyzed by performing volume integration of the target site and reference blocks using the 
ImageQuant v. 3.3 software running on a Compaq Pentium 80. 

35 

Quantitative DNase I Footprint Titration Data Analysis was performed by taking a 
background-corrected volume integration of rectangles encompassing the footprint sites and a 
reference site at which DNase I reactivity was invariant across the titration generated values for 
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the site intensities (I site ) and the reference intensity (I ref ). The apparent fractional occupancy 



(0 ap p) of the sites were calculated using the equation: 

/titt//r«f 

e w = 1 



(i) 



/•H.°//r^o 

where I site ° and I ref ° are the site and reference intensities, respectively, from a DNase I control 
lane to which no polyamide was added. 



10 



15 



20 



The ([L]tot> ©app) data were fit to a Langmuir binding isotherm (eq. 2, n=l) by 
minimizing the difference between 0 app and 9 flt , using the modified Hill equation: 

Ka n [L1 n tot 



0fit — Gmin + (Gmax - Gmin ) 



(2) 



1 + Ka n [L]"tot 

where [L, ol ] is the total polyamide concentration, is the equilibrium association constant, and 
9 mj( , and 9 

max arc the experimentally determined site saturation values when the site is 
unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting 
procedure of KaleidaGraph software (v. 3.0.1, Abelbeck Software) with K a , 0^, and 9 mjn as the 
adjustable parameters. The goodness of fit of the binding curve to the data points is evaluated 
by the correlation coefficient, with R > 0.97 as the criterion for an acceptable fit. Four sets of 
acceptable data were used in determining each association constant. All lanes from a gel were 
used unless a visual inspection revealed a data point to be obviously flawed relative to 
neighboring points. The data were normalized using the following equation: 

Gapp - Gmin 

Onorm = ' (3) 



Grr 



- Grr 



TABLE 6 Discrimination of 5'-TGTAA-3' and S'-TGTTA-y* 
Pair" 5'-TCTAA-3' 5'-TCTTA-3' AW 



5'-T G T [a] A- 3 ' 5'-T G t|*1a-3' 
•OCX #OCK 

Py/Py *XOOO J +XKX)0 ; 2.0 

J J 3'-A C a[t)t-5' 3*-A C A A T-5' 



K 6 = 0.014 uM 



K d = 0.007 u.M 



5*-T G T [a] A-3' 5'-T G T[*1a-3' 

•OOn 

Py/Hp T/v WfM- iK>00® J 0.36 

^'"P 3"-A C a|t|T-5' 3'-A C A I]T-5' 



iXXK)® J 
r-A c a[t)i 

/r d = 0.20uM 



K d = 0.56 uM 



5'-T G T 



A-3' 5'-T G T 



A-3' 



14 



Hp/Py +XKX>0 ; ^KKX>0 ; 

3*-A C A[tJt-5' 3*-A C aIaJt-S' 

* d = 40 uM K d = 0.28uM 

The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments on a 3' E P 
labeled 370-bp pDEHl EcoRl/PuwII DNA restriction fragment 15 . The 
assays were carried out at 22 °C, pH 7.0 in the presence of 10 mM 
Tris»HCl, 10 mM KC1, 10 mM MgCh, and 5 mM CaCl 2 . 
tRing pairing opposite T»A and A«T in the third position 
tCalculated as KJ5'-TGTAA-y)/KJ5'-TGTVA-y). 
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TABLE 7 Discrimination of 5'-TGTTT-3' and 5'-TGATT-3" 



Pairt 



Py/Py 



5'-TGATT-3' 



5'-TGTTT-3' 




0.026 \iM 



5'-T GIT T-3* 

+>ooC<y 

3'-A CIS|A A-5' 
AT d = 0.005 uM 



5.2 



Hp/Py 



S'-T GijT T-3* 
wEOn 

3--A cgjA A-5' 



5'-T Gfilx T-3' 

■6KH5CO' 

3'-A C LA A A-5 1 




The reported equilibrium dissociation constants are the mean values 
obtained from two DNase I footprint titration experiments. The assays 
were carried out at 22 °C, pH 7.0 in the presence of 10 mM Tris-HCl, 
10 mM KC1, 10 mM MgCI 2 , and 5 mM CaCK 
TRing pairing opposite T»A and A«T in the third position. 
♦Calculated as K d (5'-TGATT -3')/ K d (5'-TCTTT-3'). 



EXAMPLE 5: 

PREPARATION OF A BIFUNCTIONAL Hp-Py-Im-POLYAMIDE. 

ImImOpPy-y-lmPyPyPy-$-Dp-NH2. ImlmOpPy-y-ImPyPyPy-p-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam- 
Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 
polyamides containing imidazole and pyrrole amino acids. J. Am. Chem. Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (1 mL) activation 
occurs. A sample of ImlmOpPy-y-ImPyPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was 
placed in a glass 20 mL peptide synthesis vessel and treated with neat S^'-diamino-AT- 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
solution purified by reversed phase HPLC. ImImOpPy-y-ImPyPyPy-p-Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (93 mg, 46% recovery). UV 
(H20) A m ax 246, 316 (66,000); ] H NMR (DMSO-dtf) 5 10.34 (s, 1 H), 10.30 (br s, 1 H), 10.25 
(s, 1 H), 9.96 (s, 1 H), 9.95 (s, 1 H), 9.89 (s, 1 H), 9.24 (s, 1 H), 9.11 (s, 1 H), 8.08 (t, 1 H, J = 
5.6 Hz), 8.0 (m, 5 H), 7.62 (s, 1 H), 7.53 (s, 1 H), 7.42 (s, 1 H), 7.23 (d, 1H, J = 1.2 Hz), 7.21 
(m, 2 H), 7.15 (m, 2 H), 7.13 (d, 1 H), 7.1 1 (m, 2 H), 7.04 (d, 1 H), 6.84 (m, 3 H), 3.98 (s, 3 H), 
3.97 (s, 3 H), 3.92 (s, 3 H), 3.82 (m, 6 H), 3.80 (s, 3 H), 3.77 (d, 6 H), 3.35 (q, 2 R,J= 5.8 Hz) 
3.0-3.3 (m, 8 H), 2.86 (q, 2 H, J = 5.4 Hz), 2.66 (d, 3 H, J = 4.5 Hz), 2.31 (m, 4 H), 1.94 
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(quintet, 2 H, J = 6.2 Hz), 1.74 (m, 4 H); MALDI-TOF-MS (monoisotopic), 1296.0 (1296.6 
calc. for C6OH78N23O11). 

ImImOpPy-y-ImPyPyPy-$~Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
5 in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImImOpPy-y-ImPyPyPy-P-NH2 (13 mg, 10 nmol) dissolved in DMSO 
(750 \iL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
hydroiyzed (0.1M NaOH, 3 mL, 55 °C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
10 provide ImlmOpPy-y-ImPyPyPy-p-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.5 mg, 40% recovery). MALDI-TOF-MS (monoisotopic), 1570.9 
(1570.7 calc. for C70H92N25O18). 

ImImHpPy-y-ImPyPyPy-$-Dp-EDTA. In order to remove the methoxy protecting group, 
15 a sample of ImlmOpPy-y-ImPyPyPy-P-Dp-EDTA (5 mg, 3.1 ^mol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 \iL) and thiophenol (500 ^L) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 
mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmOpPy-y-ImPyPyPy-P-Dp-EDTA (5 mg) dissolved in DMF (500 \iL) was added. The 
20 solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 
solution and purified by reversed phase HPLC. ImlmHpPy-y-lmPyPyPy-P-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
25 recovery). UV (H2O) X. max 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1556.6 
(1556.7 calc. for C69H90N25O18). 

ImlmPyPy-y-lmOpPyPy-fy-Dp-imi. ImlmPyPy-y-ImOpPyPy-p-Pam-Resin was 
synthesized in a stepwise fashion by machine-assisted solid phase methods from Boc-p-Pam- 

30 Resin (0.66 mmol/g). Baird, E. E. & Dervan, P. B. describes the solid phase synthesis of 
poly amides containing imidazole and pyrrole amino acids. 1 Am. Chem, Soc. 118, 6141-6146 
(1996); also see PCT US 97/003332. 3-hydroxypyrrole-Boc-amino acid (0.7 mmol) was 
incorporated by placing the amino acid (0.5 mmol) and HBTU (0.5 mmol) in a machine 
synthesis cartridge. Upon automated delivery of DMF (2 mL) and DIEA (I mL) activation 

35 occurs. A sample of ImlmPyPy-y-ImOpPyPy-p-Pam-Resin (400 mg, 0.40 mmol/gram) was 
placed in a glass 20 mL peptide synthesis vessel and treated with neat 3,3'-diamino-Af- 
methyldipropylamine (2 mL) and heated (55 °C) with periodic agitation for 16 h. The reaction 
mixture was then filtered to remove resin, 0.1% (wt/v) TFA added (6 mL) and the resulting 
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solution purified by reversed phase HPLC. ImImPyPy-y-ImOpPyPy-(^Dp-NH2 is recovered 
upon lyophilization of the appropriate fractions as a white powder (104 mg, 54% recovery). UV 
(H2O) X raax 246, 316 (66,000); MALDI-TOF-MS (monoisotopic), 1296.6 (1296.6 calc. for 
C60H78N23OH). 

5 

ImImPyPy-y-ImOpPyPy-$-Dp-EDTA. Excess EDTA-dianhydride (50 mg) was dissolved 
in DMSO/NMP (1 mL) and DIEA (1 mL) by heating at 55 °C for 5 min. The dianhydride 
solution was added to ImImPyPy-y-ImOpPyPy-P-NH2 (13 mg, 10 nmol) dissolved in DMSO 
(750 nL). The mixture was heated (55 °C, 25 min.) and the remaining EDTA-anhydride 
10 hydrolyzed (0.1M NaOH, 3 mL, 55 °C, 10 min). Aqueous TFA (0.1% wt/v) was added to 
adjust the total volume to 8 mL and the solution purified directly by reversed phase HPLC to 
provide ImImPyPy-y-ImOpPyPy-(}-Dp-EDTA as a white powder upon lyophilization of the 
appropriate fractions (5.9 mg, 42% recovery). MALDI-TOF-MS (monoisotopic), 1570.8 
(1570.7 calc. for C70H92N25O18). 

15 

ImImPyPy-y-ImHpPyPy-$-Dp-EDTA. In order to remove the methoxy protecting group, 
a sample of ImlmPyPy-y-ImOpPyPy-p-Dp-EDTA (5 mg, 3.1 ^mol) was treated with sodium 
thiophenoxide at 100 °C for 2 h. DMF (1000 \iL) and thiophenol (500 \iL) were placed in a (13 
x 100 mm) disposable Pyrex screw cap culture tube. A 60 % dispersion of sodium hydride in 

20 mineral oil (100 mg) was slowly added. Upon completion of the addition of the sodium hydride, 
ImlmPyPy-y-ImOpPyPy-p-Dp-EDTA (5 mg) dissolved in DMF (500 \iL) was added. The 
solution was agitated, and placed in a 100 °C heat block, and deprotected for 2 h. Upon 
completion of the reaction the culture tube was cooled to 0°C, and 7 ml of a 20 % (wt/v) 
solution of trifluoroacetic acid added. The aqueous layer is separated from the resulting biphasic 

25 solution and purified by reversed phase HPLC. ImlmPyPy-y-ImHpPyPy-p-Dp-EDTA is 
recovered as a white powder upon lyophilization of the appropriate fractions (3.2 mg, 72 % 
recovery). UV (H2O) X max 246, 312 (66,000); MALDI-TOF-MS (monoisotopic), 1555.9 
(1556.7 calc. for C69H90N25O18). 

30 EXAMPLE 6: 

DETERMINATION OF POLYAMIDE BINDING ORIENTATION 

Affinity cleavage experiments using hairpin polyamides modified with EDTA-Fe(II) at 
either the C-terminus or on the y-turn, were used to determine polyamide binding orientation 
35 and stoichiometry. The results of affinity cleavage experiments are consistent only with 
recognition of 6-bp by an 8-ring hairpin complex and rule out any extended 1:1 or overlapped 
complex formation. In addition, affinity cleavage experiments reveal hairpin formation 
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supporting the claim that it is the Hp/Py and Py/Hp pairing which form at both match and 
mismatch sites to discriminate A»T from T* A. 

Affinity cleavage reactions were executed in a total volume of 40 jiL. A stock solution of 
5 polyamide or H 2 0 was added to a solution containing labeled restriction fragment (20,000 
cpm), affording final solution conditions of 25 mM Tris- Acetate, 20 mM NaCl, 100 \iM/bp calf 
thymus DNA, and pH 7.0. Solutions were incubated for a minimum of 4 hours at 22°C. 
Subsequently, 4 \ih of freshly prepared 100 ^iM Fe(NH4) 2 (S0 4 )2 was added and the solution 
allowed to equilibrate for 20 min. Cleavage reactions were initiated by the addition of 4 \ih of 
10 100 mM dithiothreitol, allowed to proceed for 30 min at 22 °C, then stopped by the addition of 
10 of a solution containing 1.5 M NaOAc (pH 5.5), 0.28 mg/mL glycogen, and 14 jiM base 
pairs calf thymus DNA, and ethanol precipitated. The reactions were resuspended in lx 
TBE/80% formamide loading buffer, denatured by heating at 85 °C for 15 min, and placed on 
ice. The reaction products were separated by electrophoresis on an 8% polyacrylamide gel 
15 (5% cross-link, 7 M urea) in lx TBE at 2000 V for 1 .5 hours. Gels were dried and exposed to a 
storage phosphor screen. Relative cleavage intensities were determined by volume integration 
of individual cleavage bands using ImageQuant software. 
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EXAMPLE 7: 

IMPROVEMENT TO POLYAMIDE SEQUENCE SPECIFICITY. 



10 



15 



20 



The polyamide* of this invention provide improved specificity relative to existing 
polyamide technology. Turner, J. T, Baird, E. E., and Dervan, P.B. describe the recognition of 
seven base pair sequences in the minor groove of DNA by ten-ring pyrrole-imidazole 
polyamide hairpins 1 Am. Chem. Soc. 1997 7/9, 7636. For example, quantitative DNasel 
footprint titrations reveal that the 10-ring hairpin ImPyPyPyPy-y-ImPyPyPyPy-p-Dp binds a 5*- 
TGTAACA-3- sequence with an equlibrium dissociation constant of 0.083 nM, and 18-fold 
specificity versus a single base mismatch site. A number of other sites are also bound on the 
252-bp DNA fragment used for the footprint titration experiments. (Figure 13). Introduction of 
a Hp/Py and Py/Hp pair in the 10-ring polyamide, ImHpPyPyPy-y-ImHpPyPyPy-p-Dp, to 
recognize a T« A and A-T within the 7-bp target sequence, increases the sequence-specificty. For 
example, a single base mismatch site 5'-TGGAACA-3 is discriminated by > 5000-fold (Figure 
13, Table 8). In fact all 245 7-bp mismatch sites present on the restriction fragment are 
discriminated > 5000-fold by the polyamide ImHpPyPyPy-y-ImHpPyPyPy-p-Dp (Figure 13). 
For cases where three A,T base pairs are present in succession it is preferred to substitute Py/Py 
in place of at least one Hp/Py or Py/Hp to provide for recognition of A«T and T*A at a single 
position. 



TABLE 8 Equilibrium dissociation constants* 



Polyamidet 5'-TGGTCA-3' 



5'-TGGACA-3' 



5'-T oHaIcA-3* 5*-T G[G]T[A|C A-3* 

3'~A C A T T O T-5' 3'- A C tCjAgjG x-5" 
* d =1.5nM 



K d = 0.083 nM 



18 



5'-T £[J|a[Bc A-3* 5'-T o[alT|Alc A-3' 

Hp/Py -6KK>0O®#^ >5000 

3*-A C[A]T[t]G T-5' 3'-A c[cJa[tJg T-5' 

*d = 02 nM /r d >1000nM 

*The reported dissociation constants are the average values obtained from three 
DNase 1 footprint titration experiments. The standard deviation for each data set is 
less than 15% of the reported number. Assays were carried out in the presence of 10 
mM Tris'HCI, 10 mM KC1, 10 mM MgOZ and 5 mM CaC12 at pH 7.0 and 22 °C 
tRing pairing opposite T-A and A-T in the fourth position. 
^Calculated as /C d (5'-TGGTACA-3')/K d (5'-TGTAACA-3'). 



25 EXAMPLE 8: 

USE OF PAIRING CODE 



There are 256 possible four base pair combinations of A, T, G, and C. Of these, there are 
a possible 240 four base pair sequences which contain at least 1 A#T or T»A base pair and 
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therefore can advantageously use an Hp/Py, or Py/Hp carboxamide binding. Polyamides 
binding to any of these sequences can be designed in accordance with the code of TABLE 2. 
Table 9 lists the sixteen eight-ring hairpin polyamides (1-16) which recognize the sixteen 5*- 
WGTNNW-3' sequences (W = A or T, X = A, G, C, or T). Table 10 lists the sixteen eight-ring 

5 hairpin polyamides (17-32) which recognize the sixteen 5 > -WGANNW-3' sequences (17-32). 
Table 11 lists the twelve eight-ring hairpin polyamides (33-44) which recognize twelve 5'- 
WGGNNW-3' sequences which contain at least one A,T base pair. Table 1 1 lists the four eight- 
ring hairpin polyamides (G1-G4) which target the four 5*-WGGNNW-3* sequences (G1-G4) 
which contain exclusively G»C base pairs. Table 12 lists the twelve eight-ring hairpin 

10 polyamides (45-56) which recognize twelve 5 , -WGCNNW-3' sequences which contain at least 
one A,T base pair. Table 12 lists the four eight-ring hairpin polyamides (G5-G8) which target 
the four 5'-WGCNNW-3' sequences (G5-G8) which contain exclusively G«C base pairs. Table 
13 lists the sixteen eight-ring hairpin polyamides (57-72) which recognize the sixteen 5'- 
WTTNNW-3' sequences (57-72). Table 14 lists the sixteen eight-ring hairpin polyamides (73- 

15 88) which recognize the sixteen 5'-WTANNW-3' sequences (73-88). Table 15 lists the sixteen 
eight-ring hairpin polyamides (89-104) which recognize the sixteen 5'-WTGNNW-3' sequences 
(89-104). Table 16 lists the sixteen eight-ring hairpin polyamides (105-120) which recognize 
the sixteen 5 y -WTCNNW-3 y sequences (105-120). Table 17 lists the sixteen eight-ring hairpin 
polyamides (121-136) which recognize the sixteen 5'-WATNNW-3* sequences (121-136). 

20 Table 18 lists the sixteen eight-ring hairpin polyamides (137-152) which recognize the sixteen 
5'-WAANNW-3' sequences (137-152). Table 19 lists the sixteen eight-ring hairpin polyamides 
(153-168) which recognize the sixteen 5*-WAGNNW-3* sequences (153-168). Table 20 lists 
the sixteen eight-ring hairpin polyamides (169-184) which recognize the sixteen 5'-WACNNW- 
3* sequences (169-184). Table 21 lists the sixteen eight-ring hairpin polyamides (185-200) 

25 which recognize the sixteen 5'-WCTNNW-3' sequences (185-200). Table 22 lists the sixteen 
eight-ring hairpin polyamides (201-216) which recognize the sixteen 5'-WCANNW-3' 
sequences (201-216). Table 23 lists the twelve eight-ring hairpin polyamides (217-228) which 
recognize the twelve 5'-WCGNNW-3* sequences which contain at least one A,T base pair. 
Table 23 lists the four eight-ring hairpin polyamides (G9-G12) which target the four 5'- 

30 WCGNNW-3' sequences (G9-G12) which contain exclusively OG base pairs. Table 24 lists 
the twelve eight-ring hairpin polyamides (229-240) which recognize the twelve 5'-WCCNNW- 
3' sequences which contain at least one A,T base pair. Table 24 lists the four eight-ring hairpin 
polyamides (G13-G16) which target the four 5^WCCNNW-3' sequences (G13-G16) which 
contain exclusively OG base pairs. 
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TABLE 9: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WGTNNW-3* 
5 DNA sequence aromatic amino acid sequence 



1) 


5 " 


-W 


G 


T 


T 


T 


W-3 • 


1) IrnHnHDHo -Y- PvPvPvPv 


2) 


5' 


-W 


G 


T 


T 


A 


W-3' 


2 ) ImHpHp Py - y - Hp Py P yPy 


3) 


5' 


-w 


G 


T 


T 


G 


W-3 1 


3) ImHpHpIm-y- PyPyPyPy 


4) 


5' 


-w 


G 


T 


T 


C 


W-3' 


4 ) ImHpHpPy -y- ImPyPyPy 


5) 


5' 


-w 


G 


T 


A 


T 


W-3 1 


5) ImHpPyHp-y-PyHpPyPy 


6) 


5 1 


-w 


G 


T 


A 


A 


W-3' 


6) ImHpPyPy-y-HpHpPyPy 


7) 


5' 


-w 


G 


T 


A 


G 


W-3 " 


7) ImHpPy Im-y- PyHpPyPy 


8) 


5' 


-w 


G 


T 


A 


C 


W-3 1 


8) ImHpPyPy-y- ImHpPyPy 


9) 


5 1 


-w 


G 


T 


G 


T 


W-3 1 


9) ItnHpImHp -y- PyPyPyPy 


10) 


5« 


-w 


G 


T 


G 


A 


W-3' 


10) ImHpImPy-y-HpPyPyPy 


ID 


5« 


-w 


G 


T 


G 


G 


W-3' 


11) ImHpImlm-y- PyPyPyPy 


12) 


5 


-w 


G 


T 


G 


C 


W-3 1 


12 ) ImHpImPy-y- ImPyPyPy 


13) 


5 


-w 


G 


T 


C 


T 


W-3 ' 


13) ImHpPyHp-y-PylmPyPy 


14) 


5 


• -w 


G 


T 


C 


A 


W-3' 


14) ImHpPyPy-y-HpImPyPy 


15) 


5 


» -w 


G 


T 


C 


G 


W-3' 


15) ImHpPylm-y-PylmPyPy 


16) 


5 


1 -w 


G 


T 


C 


C 


W-3' 


16) ImHpPyPy-y-ImlmPyPy 
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TAB LE 10: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WGANNW-3' 
DNA sequence aromatic amino acid sequence 



17) 5 ' -W G A T T W-3' 17) ImPyHpHp-y-PyPyHpPy 

18) 5 1 -W G A T A W-3' 18) ImPyHpPy-y-HpPyHpPy 

19) 5 1 -W G A T G W-3' 19) ImPyHpIm-y-PyPyHpPy 

20) 5 1 -W G A T C W-3» 20) ImPyHpPy-y-ImPyHpPy 

21) 5 1 -W G A A T W-3' 21) ImPyPyHp-y-PyHpHpPy 

22) 5'-W G A A A W-3 1 22) ImPyPyPy-y-HpHpHpPy 

23) 5»-W G A A G W-3» 23) ImPyPylm-y-PyHpHpPy 

24) 5 1 -W G A A C W-3 1 24) ImPyPyPy-y-ImHpHpPy 

25) 5 1 -W G A G T W-3 1 25) ImPylmHp-y-PyPyHpPy 

26) 5 1 - W G A G A W-3 1 26) ImPylmPy-y-HpPyHpPy 

27) 5 1 -W G A G G W-3 1 27) ImPylmlm-y-PyPyHpPy 

28) 5 1 -W G A G C W-3 1 28) ImPylmPy-y-ImPyHpPy 

29) 5'-W G A C T W-3' 29) ImPyPyHp-y-PylmHpPy 

30) 5'-W G A C A W-3 1 30) ImPyPyPy-y-HpImHpPy 

31) 5 1 -W G A C G W-3' 31) ImPyPylm-y-PylmHpPy 

32) 5 1 -W G A C C W-3 1 32 ) ImPyPyPy-y- ImlmHpPy 
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TABLE 1 1 : 8-ring Hairpin Polyamides for recognition of 6-bp 



DNA sequence aromatic amino acid sequence 



5 


33) 


5' 


-W 


G 


G 


T 


T 


W-3' 


33) ImlmHpHp-y-PyPyPyPy 




34) 


5« 


-W 


G 


G 


T 


A 


W-3« 


34) ImlmHpPy-y-HpPyPyPy 


10 


35) 


5' 


-W 


G 


G 


T 


G 


W-3' 


35) ImlmHpIm-y-PyPyPyPy 




36) 


5« 


-W 


G 


G 


T 


C 


W-3' 


36) ImlmHpPy-y- ImPyPyPy 




37) 


5« 


-w 


G 


G 


A 


T 


W-3' 


37) ImlmPyHp-y-PyHpPyPy 


15 


38) 


5« 


-w 


G 


G 


A 


A 


W-3» 


38) ImlmPyPy^y-HpHpPyPy 




39) 


5' 


-w 


G 


G 


A 


G 


W-3» 


3 9) ImlmPylm-y-PyHpPyPy 


in 
zu 


40) 


5' 


-w 


G 


G 


A 


C 


W-3' 


40) ImlmPyPy-y-ImHpPyPy 




41) 


5' 


-w 


G 


G 


G 


T 


W-3* 


41) ImlmlmHp-y-PyPyPyPy 




42) 


5' 


-w 


G 


G 


G 


A 


W-3» 


42) ImlmlmPy-y-HpPyPyPy 


25 


43) 


5' 


-w 


G 


G 


C 


T 


W-3' 


43) ImlmPyHp-y-PylmPyPy 




44) 


5' 


-w 


G 


G 


C 


A 


W-3' 


44) ImlmPyPy-y-HpImPyPy 


30 


Gl) 


5' 


-w 


G 


G 


G 


G 


W-3' 


Gl) Imlmlmlm-y- PyPyPyPy 




62) 


5' 


-w 


G 


G 


G 


C 


W-3' 


G2) ImlmlmPy-y-ImPyPyPy 




G3) 


5' 


-w 


G 


G 


C 


G 


W-3' 


G3) ImlmPylm-y-PylmPyPy 


35 


G4) 


5 


-w 


G 


G 


C 


C 


W-3' 


G4) ImlmPyPy-y-ImlmPyPy 
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TABLE 12: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WGCNNW-3* 

— ~™ DNA seq uence aromatic amino acid sequence 

45) 5 1 -W G C T T W-3 1 45) ImPyHpHp-y-PyPylmPy 



46) 5»-W G £ T A W-3 1 46) ImPyHpPy-y-HpPylmPy 

47) 5 ' -W G C T G W-3» 47) ImPyHpIm-y-PyPylmPy 

48) 5 1 -W G C T C W-3 1 48) ImPyHpPy-y-ImPylmPy 

49) 5 1 -W G C A T W-3 1 49) ImPyPyHp-y-PyHpImPy 

50) 5 1 -W G C A A W-3 1 50) ImPyPyPy-y-HpHpImPy 

51) 5 1 -W G C A G W-3» 51) ImPyPylm-y-PyHplmPy 

52) 5 1 -W G C A C W-3' 52) ImPyPyPy-y- ImHpImPy 

53) 5'-W G C G T W-3 1 53) ImPylmHp-y-PyPylmPy 

54) 5 1 -W G C G A W-3 1 54) ImPylmPy-y-HpPylmPy 

55) 5 • -W G C C T W-3 1 55) ImPyPyHp-y-PylmlmPy 

56) 5'-W G C C A W-3» 56) ImPyPyPy-y-HpImlmPy 
G5) 5 1 -W G C G G W-3 1 G5) ImPylmlm-y-PyPylmPy 
G6) 5 1 -W G C G C W-3' G6) ImPylmPy-y-ImPylmPy 
G7) 5 1 -W G C C G W-3» G7) ImPyPylm-y-PylmlmPy 
G8) 5'-W G C C C W-3' G8) ImPyPyPy-y- ImlmlmPy 
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TABLE 13: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTTNNW-3* 
DNA sequence aromatic amino acid sequence 

57) 5'-W T T T T W-3 " * * " ™~ — — ■ 

58) 5»-W T T T A W-3 

59) 5'-W T T T G W-3 

60) 5 1 -W T T T C W-3 

61) 5 1 -W T T A T W-3 

62) 5 ' -W T T A A W-3 

63) 5'-W T T A G W-3 
.64) 5 1 -W T T A C W-3 

65) 5»-W T T G T W-3 

66) 5 1 -W T T G A W-3' 

67) 5«-W T T G G W-3' 

68) 5'-W T T G C W-3' 

69) 5'-W T T C T W-3 1 

70) 5--WT T C A W-3 1 

71) 5'-W T T C G W-3» 

72) 5'-W T T C C W-3 1 



10 



15 



20 



25 



30 



35 



5 7 ) HpHpHpHp - y- PyPyPyPy 

58) HpHpHpPy-y-HpPyPyPy 

59) HpHpHpIm-y-PyPyPyPy 

60 ) HpHpHpPy-Y- ImPyPyPy 

6 1 ) HpHpPyHp - y - PyHpPyPy 

62) HpHpPyPy-Y-HpHpPyPy 

63 ) HpHpPylm-Y-PyHpPyPy 

64 ) HpHpPyPy-y- ImHpPyPy 

6 5 ) HpHp imHp -y-PyPy Py Py 

66 ) HpHpImPy-y-HpPyPyPy 

67) HpHpImlm-y-PyPyPyPy 

68) HpHpImPy-y-ImPyPyPy 

6 9 ) HpHpPyHp - Y - Py ImPyPy 

7 0 ) HpHp Py Py - y " Hp I mPy Py 

71) HpHpPylm-Y-PylmPyPy 

72 ) HpHpPyPy-y- ImlmPyPy 
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TABLE 14: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTANNW-3' 
— pi^^ sequence aromatic amino acid sequence 

73) 5 1 -W T A T T W-3' 73) HpPyHpHp-y-PyPyHpPy 

74) 5 1 -W T "A T A W-3' 74) HpPyHpPy-y-HpPyHpPy 

75) 5 1 -W T A T G W-3' 75) HpPyHpIm-y-PyPyHpPy 

76) 5 1 -W T A T C W-3» 76)HpPyHpPy-y-ImPyHpPy 

77) 5 1 -W T A A T W-3 1 77) HpPyPyHp-y-PyHpHpPy 

78) 5 1 -W T A A A W-3 1 78) HpPyPyPy-y-HpHpHpPy 

79) 5 1 -W T A A G W-3 1 7 9)HpPyPyIm-y-PyHpHpPy 

80) 5'-W T A A C W-3» 80 ) HpPyPyPy-y-ImHpHpPy 

81) 5'-W T A G T W-3 1 81)HpPyImHp-y-PyPyHpPy 

82) 5 1 -W T A G A W-3 1 82) HpPylmPy-y-HpPyHpPy 

83) 5*-W T A G G W-3» 83)HpPyImIm-y-PyPyHpPy 

84) 5 1 -W T A G C W-3 1 84)HpPyImPy-y-ImPyHpPy 

85) 5 1 -W TACT W-3» 85)HpPyPyHp-y-PyImHpPy 

86) 5 1 -W T A C A W-3 1 86) HpPyPyPy-y-HpImHpPy 

87) 5 1 -W T A C G W-3 1 87 ) HpPyPylm-y-PylmHpPy 

88) 5'-W T A C C W-3 1 88) HpPyPyPy-y- ImlmHpPy 
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TABLE 15: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WTGNNW-3' 



DNA sequence aromatic amino acid sequence 



5 


89) 


5' 


-W 


T 


G 


T 


T 


W-3' 


89) 


HpImHpHp -y- PyPyPyPy 




90) 


5' 


-W 


T 


G 


T 


A 


W-3' 


90) 


Hp I mHp Py - y - Hp Py Py Py 


10 


91) 


5' 


-w 


T 


G 


T 


G 


W-3' 


91) 


Hp ImHpIm- y - PyPyPyPy 




92) 


5' 


-w 


T 


G 


T 


C 


W-3 1 


92) 


HpImHpPy-y- ImPyPyPy 




93) 


5' 


-w 


T 


G 


A 


T 


W-3 1 


93) 


HpImPyHp - y- PyHpPyPy 


15 


94) 


5* 


-w 


T 


G 


A 


A 


W-3' 


94) 


HpImPyPy - y - HpHpPyPy 




95) 


5' 


-w 


T 


G 


A 


G 


W-3' 


95) 


HpImPylm-y- PyHpPyPy 


20 


96) 


5« 


-w 


T 


G 


A 


C 


W-3' 


96) 


Hp ImPy Py - y - 1 mHp Py Py 




97) 


5' 


-w 


T 


G 


G 


T 


W-3' 


97) 


HpImlmHp - y- PyPyPyPy 




98) 


5' 


-w 


T 


G 


G 


A 


W-3' 


98) 


Hp ImlmPy - y - HpPy Py Py 


25 


99) 


5 


-w 


T 


G 


C 


T 


W-3' 


99) 


HpImPyHp -y-PylmPyPy 




100) 


5 


-w 


T 


G 


C 


A 


W-3' 


100 


Hp ImPy Py - y - Hp I mPy Py 


30 


101) 


5 


-w 


T 


G 


G 


G 


W-3' 


101 


) Hplmlmlm-y- PyPyPyPy 




102) 


5 


1 -w 


T 


G 


G 


C 


W-3' 


102) HpImlmPy-y-ImPyPyPy 




103) 


5 


• -w 


T 


G 


C 


G 


W-3' 


103) HpImPylm-y-PylmPyPy 


35 


104) 


5 


» -w 


T 


G 


C 


C 


W-3' 


104) HpImPyPy-y- ImlmPyPy 
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TABLE 16: 8-ring Hairpin Polyamidcs for recognition of 6-bp 5'-WTCNNW-3' 
DNA sequence aromatic amino acid sequence 



105) 5*-W T C T T W-3' 

106) 5 1 -W T C T A W-3» 

107) 5 1 -W T C T G W-3« 

108) 5'-W T C T C W-3' 

109) 5 1 -W T C A T W-3» 

110) 5 • -W T C A A W-3 1 

111) 5 1 -W T C A G W-3' 

112) 5 1 -W T C A C W-3 1 

113) 5 1 -W T C G T W-3 1 

114) 5' -W T C G A W-3 1 

115) 5' -W T C C T W-3' 

116) 5 1 -W T C C A W-3* 

117) 5 1 -W T C G G W-3* 

118) 5 1 -W T C G C W-3 1 

119) 5 1 -W T C C G W-3 1 

120) 5»-Wr T C C C W-3 1 



1 0 5 ) Hp PyHpHp -y- Py Py ImPy 

106) HpPyHpPy-y-HpPylmPy 

107) HpPyHpIm-y-PyPylmPy 

108 ) HpPyHpPy-y- ImPylmPy 

109) HpPyPyHp -y - PyHpImPy 

110 ) HpPyPyPy-y-HpHpImPy 

111 ) HpPyPylm-y- PyHpImPy 

112 ) HpPyPyPy-y- ImHpImPy 

113 ) HpPylmHp-y-PyPylmPy 

114) HpPyImPy-y-HpPylmPy 

115) HpPyPyHp-y-PylmlmPy 

116) HpPyPyPy-y-HpImlmPy 

117) HpPylmlm-y-PyPylmPy 

118) HpPylmPy-y-ImPylmPy 

119) HpPyPylm-y-PylmlmPy 

120) HpPyPyPy-y-ImlmlmPy 
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TABLE 17: 8-ring Hairpin Polyamides for recognition of 6-bp 5^WATNNW-3' 
5 PNA sequence aromatic amino acid sequence 



121) 


5» 


-W 


A 


T 


T 


T 


W-3 " 


12 1 ) PyHpHpHp -y- PyPyPyHp 


122) 


5* 


-W 


A 


T 


T 


A 


W-3' 


122) Py HpHp Py-y-HpPy Py Hp 


123) 


5' 


-W 


A 


T 


T 


G 


W-3» 


123) PyHpHpIm-y-PyPyPyHp 


124) 


5» 


-W 


A 


T 


T 


C 


W-3» 


124) PyHpHpPy-y-ImPyPyHp 


125) 


5' 


-W 


A 


T 


A 


T 


W-3' 


125) PyHpPyHp-y-PyHpPyHp 


126) 


5' 


-W 


A 


T 


A 


A 


W-3 « 


126) PyHpPyPy-y-HpHpPyHp 


127) 


5' 


-W 


A 


T 


A 


G 


W-3' 


12 7 ) PyHpPylm-y- PyHpPyHp 


128) 


5' 


-W 


A 


T 


A 


C 


W-3 1 


128) PyHpPyPy-y-ImHpPyHp 


129) 


5' 


-w 


A 


T 


G 


T 


W-3' 


129) PyHpImHp-y-PyPyPyHp 


130) 


5' 


-w 


A 


T 


G 


A 


W-3» 


130) PyHpImPy-y-HpPyPyHp 


131) 


5' 


-w 


A 


T 


G 


G 


W-3' 


131) PyHpImlm-y-PyPyPyHp 


132) 


5' 


-w 


A 


T 


G 


C 


W-3' 


132) PyHpImPy-y-ImPyPyHp 


133) 


5 


-w 


A 


T 


C 


T 


W-3' 


13 3) PyHpPyHp-y-PylmPyHp 


134) 


5 


-w 


A 


T 


c 


A 


W-3' 


134) PyHpPyPy-y-HpImPyHp 


135) 


5 


-w 


A 


T 


c 


G 


W-3' 


13 5) PyHpPylm-y-PylmPyHp 


136) 


5 


-w 


A 


T 


c 


C 


W-3 • 


136) PyHpPyPy-y-ImlmPyHp 
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TABLE 1 8: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WAANNW-3' 
DNA sequence aromatic amino acid sequence 



5 


137) 


5' 


-W 


A 


A 


T 


T 


W-3' 


137) PyPyHpHp-y-PyPyHpHp 




138) 


5' 


-W 


a' 


A 


T 


A 


W-3' 


138) PyPyHpPy-y-HpPyHpHp 


10 


139) 


5» 


-W 


A 


A 


T 


G 


W-3' 


139) PyPyHpIm-y-PyPyHpHp 


140) 


5' 


-W 


A 


A 


T 


C 


W-3' 


140) PyPyHpPy-y-IraPyHpHp 




141) 


5' 


-W 


A 


A 


A 


T 


W-3' 


141) PyPyPyHp-y- PyHpHpHp 


15 


142) 


5' 


-w 


A 


A 


A 


A 


W-3' 


142) PyPyPyPy-y-HpHpHpHp 




143) 


5' 


-w 


A 


A 


A 


G 


W-3' 


143 ) PyPyPylm-y- PyHpHpHp 


20 


144) 


5» 


-w 


A 


A 


A 


C 


W-3' 


144 ) pypypypy-y- ImHpHpHp 




145) 


5 


-w 


A 


A 


G 


T 


W-3' 


145) PyPylmHp-y-PyPyHpHp 




146) 


5 


-w 


A 


A 


G 


A 


W-3' 


146) PyPylmPy-y-HpPyHpHp 


25 


147) 


5 


-w 


A 


A 


G 


G 


W-3' 


147) PyPylmlm-y-PyPyHpHp 




148) 


5 


-w 


A 


A 


G 


C 


W-3' 


14 8) PyPylmPy-y-ImPyHpHp 


30 


149) 


5 


-w 


A 


A 


C 


T 


W-3' 


14 9) PyPyPyHp-y-PylmHpHp 




150) 


5 


» -w 


A 


A 


C 


A 


W-3' 


150) PyPyPyPy-y-HpImHpHp 




151) 


5 


• -w 


A 


A 


C 


G 


W-3 1 


151) PyPyPylm-y-PylmHpHp 


35 


152) 


5 


1 -w 


A 


A 


c 


C 


W-3' 


152 ) PyPyPyPy-y- ImlmHpHp 
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TABLE 19: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WAGNNW-3' 







DNA sequence 


aromatic amino acid sequence 




153 ) 


5 1 -W A G T T W- 3 1 


153 ) PylmHpHp-y-PyPyPyHp 




154) 


5 1 -W A G T A W- 3 ' 


154) PylmHpPy-y-HpPyPyHp 


10 


155) 


5 1 -W A G T G W-3» 


155) PylmHpIm-y- PyPyPyHp 




156) 


5 • -W A G T C W-3 1 


156) PylmHpPy-y-ImPyPyHp 




13 / ) 


3 ' -W A G A T W-J* 


1 5 7 ) Py ImPyHp - y - PyHpPyHp 




■i- «J O 7 


•J n /\. vj h i> rr — J 


±d8/ pyimpypy-y-HpHppyHp 




IRQ) 


j W A vj a b r¥ — J 


T C Q \ TV. w X m TV. » T — • . TV- -T T TV. -T Y 

15 9; PylmPylm-y- PyHpPyHp 


20 


160) 


5 1 -W A G A C W- 3 • 


160) PylmPyPy-y-ImHpPyHp 




1 CI \ 
lolj 




161) PylmlmHp-y-PyPyPyHp 






D —W A b (j A W- J 1 


162) PylmlmPy-y-HpPyPyHp 


25 


163) 


5 1 -W A G C T W-3' 


163) PylmPyHp-y-PylmPyHp 




164) 


5'-W A G C A W-3 1 


164) PylmPyPy-y-HpImPyHp 


30 


165) 


5 1 -W A G G G W-3' 


165) Pylmlmlm-y-PyPyPyHp 




166) 


5 1 -W A G G C W-3 1 


166) PylmlmPy-y-ImPyPyHp 




167) 


5 1 -W A G C G W-3' 


167) PylmPylm-y-PylmPyHp 


35 


168) 


5 * -W A G C C W-3 1 


16 8) PylmPyPy-y-ImlmPyHp 
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TABLE 20: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WACNNW-3' 







DNA sequence 




aromatic amino acid sequence 


5 


169) 


5' 


-W A C 


T T 


W-3' 


169) PyPyHpHp-y-PyPylmHp 




170) 


5' 


-W A~C 


T A 


W-3' 


170) Py Py Hp Py-y-HpPyl mHp 


10 


171) 


5' 


-W A C 


T G 


W-3' 


171) PyPyHpIm-y-PyPylmHp 




172) 


5' 


-W A C 


T C 


W-3' 


172 ) PyPyHpPy-y- ImPylmHp 




173) 


5' 


-W A C 


A T 


W-3' 


173 ) PyPyPyHp-y- Py Hp I mHp 


15 


174) 


5 


-W A C 


A A 


W-3' 


174 ) PyPyPyPy-y-HpHpImHp 




175) 


5 


-W A C 


A G 


W-3' 


175) PyPyPylm-y-PyHpImHp 




176) 


5 


-W A C 


A C 


W-3' 


176) PyPyPyPy-y-ImHpImHp 




177) 


5 


-W A C 


G T 


W-3' 


177) PyPylmHp-y-PyPylmHp 




178) 


5 


-W A C 


G A 


W-3' 


178) PyPylmPy-y-HpPylmHp 


25 


179) 


5 


-W A C 


C T 


W-3 1 






180) 


5 


-W A C 


C A 


W-3' 


180) PyPyPyPy-y-HpImlmHp 


30 


181) 


5 


-W A C 


G G 


W-3' 


181) PyPylmlm-y-PyPylmHp 




182) 


5 


-W A C 


G C 


W-3' 


182 ) Pypyimpy-y- ImPylmHp 




183) 


5 


-W A C 


C G 


W-3' 


183) PyPyPylm-y-PylmlmHp 


35 


184) 


5 


• -W A C 


C C 


W-3 ' 


184) PyPyPyPy-y-lmlmlmHp 
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TABLE 21: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCTNNW-3* 



DNA sequence aromatic amino acid sequence 



5 


185) 


5' 


-W 


c 


T 


T 


T 


W-3 ■ 


185) PyHpHpHp-y-PyPyPylm 




186) 


5« 


-W 


c 


T 


T 


A 


W-3 ' 


186) PvHDHnPv-v-H'nPvPvTm 


10 


187) 


5 1 


-W 


c 


T 


T 


G 


W-3 1 


187) PyHpHpIm-y-PyPyPylm 




188) 


5 1 


-W 


c 


T 


T 


C 


W-3 1 


188) PyHpHpPy-y-ImPyPylm 




189) 


5 1 


-W 


c 


T 


A 


T 


W-3 » 


189) PyHpPyHp-y-PyHpPylm 


15 


190) 


5' 


-w 


c 


T 


A 


A 


W-3' 


190) PyHpPyPy-y-HpHpPylm 




191) 


5 1 


-w 


c 


T 


A 


G 


W-3 ' 


191) PvHnPvTm -v - DvWnDvTm 
ui/ rynprjriiu f — ryn^cy j.iu 


20 


192) 


5' 


-w 


c 


T 


A 


C 


W-3 » 


192 ) PyHpPyPy -y- ImHpPylm 




193) 


5' 


-w 


c 


T 


G 


T 


W-3 1 


193)PyHpImHp-y-PyPyPyIm 




194) 


5 


-w 


c 


T 


G 


A 


W-3 1 


194 ) PyHpImPy-y-HpPyPylm 


25 


195) 


5 


-w 


c 


T 


G 


G 


W-3' 


195) PyHpImlm-y-PyPyPylm 




196) 


5 


-w 


c 


T 


G 


C 


W-3' 


196) PyHpImPy-y-ImPyPylm 


30 


197) 


5 


-w 


c 


T 


C 


T 


W-3 1 


197) PyHpPyHp-y-PylmPylm 




198) 


5 


-w 


c 


T 


C 


A 


W-3' 


198) PyHpPyPy-y-HpImPylm 




199) 


5 


• -w 


c 


T 


C 


G 


W-3' 


199) PyHpPylm-y-PylmPylm 


35 


200) 


5 


• -w 


c 


T 


C 


C 


W-3' 


200) PyHpPyPy-y-ImlmPylm 
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TABLE 22: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCANNW-3 ? 









DNA sequence 


aromatic amino acid sequence 


5 


201) 


5 


■ -W C A T T W-3' 


2 0 1 ) PyPyHpHp-y- PyPyHpIm 




202) 


5 


■ -W CTA T A W-3» 


202) PyPyHpPy-y-HpPyHpIm 


10 


203) 


5 


1 -W C A T G W-3 1 


203) PyPYHpIm-y-PyPyHpIm 




204) 


5 


-W C A T C W-3' 


204 ) PyPyHpPy-y- ImPyHpIm 




205) 


5 


-W C A A T W-3 1 


205) PyPy PyHp-y-PyHpHpIm 


15 


206) 


5 


-W C A A A W-3 1 


206) PyPyPyPy-y-HpHpHpIm 




207) 


5 


-W C A A G W-3» 


207) Py PyPy Im -y - PyHpHp Im 


20 


208) 


5 


-W C A A C W-3» 


208) PyPyPyPy-y-lmHpHpIm 




209) 


5 


-W C A G T W-3' 


209) PyPy ImHp -y - PyPyHpIm 




210) 


5« 


-W C A G A W-3' 


210) PyPylmPy-y-HpPyHpIm 


25 


211) 


5 ' 


-W C A G G W-3 1 


211) PyPylmlm-y-PyPyHpIm 




212) 


5' 


-W C A G C W-3' 


212) PyPy ImPy-y-ImPyHpIm 


30 


213) 


5> 


-W C A C T W-3» 


213) PyPyPyHp -y - Py ImHpIm 




214) 


5» 


-W C A C A W-3' 


214) Py PyPy Py-y-HpImHpIm 




215) 


5 1 


-W C A C G W-3 1 


215) PyPy Pylm-y-PylmHpIm 


35 


216) 


5' 


-W C A C C W-3' 


216) PyPyPyPy-y-lmlmHplm 
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TABLE 23: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCGNNW-3' 









DNA sequence 


aromatic amino acid sequence 


5 


217) 


5 


• -w 


C G T T W-3" 


217) PylmHpHp-y-PyPyPylm 




218) 


5 


' -w 


C G T A W-3» 


218) PylmHpPy-y-HpPyPylm 


10 


219) 


5 


1 -w 


C G T G W-3' 


FyimHpim-y- PyPyPy Im 


220) 


5 


• -w 


C G T C W-3 f 


220) PylmHpPy-y-ImPyPylm 




221) 


5 


• -w 


C G A T W-3 1 


221) PylmPyHp-y-PyHpPylm 


1 c 
1 J 


222 / 


5 


• -w 


C G A A W-3 1 


222) PylmPyPy-y-HpHpPylm 




223 ) 


5 


1 -w 


C G A G W-3 1 


223) PylmPylm-y-PyHpPylm 


20 


224) 


5 




C G A C W-3 1 


224 ) PvTmPvPv - V - TmHnPvTm 




225) 


5 


' -w 


C G G T W-3» 


225) PylmlmHp-y-PyPyPylm 




226) 


5 


• -w 


C G G A W-3 1 


226) PylmlmPy-y-HpPyPylm 


25 


227) 


5 


• -w 


C G C T W-3' 


227) PylmPyHp-y-PylmPylm 




228) 


5' 


-W C G C A W-3' 


228) PylmPyPy-y-HpImPylm 


30 


G9) 


5' 


-W C G G G W-3< 


G9) Pylmlmlm-y-PyPyPylm 




G10) 


5' 


-W C G G C W-3* 


G10) PylmlmPy-y-ImPyPylm 




Gil) 


5' 


-W C G C G W-3 1 


Gil) PylmPylm-y-PylmPylm 


35 


G12) 


5' 


-W C G C C W-3' 


G12 ) PylmPyPy-y- ImlmPylm 
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TABLE 24: 8-ring Hairpin Polyamides for recognition of 6-bp 5'-WCCNNW-3' 







DNA sequence 




aromatic amino acid sequence 


5 


229) 


5 * -W C C T T 


W-3' 


229) PyPyHpHp-y-PyPylralm 




230) 


5» -W C-C T A 


W-3' 


23 0) PyPyHpPy-y-HpPylmlm 


10 


231) 


5 1 -W C C T G 


W-3' 


231) PyPyHpIm-y-PyPylmlm 




232) 


5 1 -W C C T C 


W-3' 


232)PyPyHpPy-y-ImPyImIm 




233) 


5'-W C C A T 


W-3' 


233) PyPyPyHp-y-PyHpImlm 


15 


234) 


5 ' -W C C A A 


W-3' 


234) PyPyPyPy-y-HpHpImlm 




235) 


5 ' -W C C A G 


W-3' 


235) PyPyPylm-Y-PyHpImlm 


20 


236) 


5 ' -W C C A C 


W-3' 


236) Py PyPyPy -y - ImHpImlm 




237) 


5 • -W C C G T 


W-3' 


237) PyPylmHp-Y-PyPylmlm 




238) 


5 1 -W C C G A 


W-3' 


238) PyPylmPy-Y-HpPylmlm 


25 


239) 


5'-W C C C T 


W-3' 


239) PyPyPyHp-Y-Pylmlmlm 




240) 


5 ' -W C C C A 


W-3' 


240) PyPyPyPy-Y-HpImlmlm 


30 


G13) 


5 1 -W C C G G 


W-3' 


G13) PyPylmlm-Y-PyPylmlm 


G14) 


5'-W C C G C 


W-3' 


G14 ) PyPy ImPy-y- ImPylmlm 




G15) 


5 • -W C C C G 


W-3' 


G15) PyPyPylm-y-Pylmlmlm 


35 


G16) 


5'-W C C C C 


W-3' 


G16) PyPyPyPy-y-Imlmlmlm 



EXAMPLE 9: 

Aliphatic/Aromatic amino acid pairing for recognition of the DNA minor groove. 

Selective placement of an aliphatic [J-alanine (P) residue paired side-by-side with either 
a pyrrole (Py) or imidazole (Im) aromatic amino acid is found to compensate for sequence 
composition effects for recognition of the minor groove of DNA by hairpin pyrrole-imidazole 
polyamides. A series of polyamides were prepared which contain pyrrole and imidazole 
aromatic amino acids, as well as y-aminobutyric acid (y) "turn" and P-alanine "spring" aliphatic 
amino acid residues. The binding affinities and specificities of these polyamides are regulated 
by the placement of paired p/p Py/p and Im/p residues. Quantitative footprint titrations 
demonstrate that replacing two Py/Py pairings in a 12-ring hairpin (6-y-6) with two Py/p 
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pairings affords 10-fold enhanced affinity and similar sequence specificity for an 8-bp target 
sequence. 

Table 25 Equilibrium association constants (M* ' ) for polyamidcs. o c 



Poly amide y 


•TGTTAACA-3' 


S'.TGTGAACA-3' 


Specificity*' 


•OOOOCK 


2 .5 x 10* 


3.9 x 10 8 


6 




1.3 x 10 9 


2.0 x 10* 


7 


•ockxxx 

HHKXKH>r 


1.7 x Hr 10 


2.7 x 10 9 


6 




UxlO 11 


2.2 x 10 9 


55 


^-<KKKXX>V 


6.6 x 10* 


2.5 x 10* 


26 


•-OOOO-CK 


4.5 x 10'° 


7.7 x 10 9 


6 


♦OOOCKX 


2.7 x 10 10 


5.7 x 10 9 


5 


•<HXKXK 

$-^><><><><K>V 


s I x 10* 


s I x 10 8 


1 



"Values reported are the mean values obtained from three DNase 1 
footprint titration experiments. * The assays were carried out at 22 °C at 
pH 7.0 in the presence of 10 mM Tris-HCl, 10 mM KC1. 10 mM MgCI 2 , 
and 5 mM CaCU. c Match site association constants and specificities 
higher than the parent hairpin are shown in boldly pe. ^Specificity is 
calculated as K a (match) / K a ( mismatch). 

5 

The 6-y-6 hairpin ImPylmPyPyPy-y-ImPyPyPyPyPy-p-Dp, which contains six 
consecutive amino acid pairings, is unable to discriminate a single-base-pair mismatch site 5'- 
TGTTAACA-3' from a 5 ' -TGTG AAC A-3 ' match site. The hairpin polyamide Im-p- 
ImPyPyPy-y-ImPyPyPy-P-Py-p-Dp binds to the 8-bp match sequence 5'-TGTGAACA-3' with 
10 an equilibrium association constant of K a = 2.4 x 10^ M~* and > 48-fold specificity versus the 
5'-TGTTAACA-3' single-base-pair mismatch site. 



51 



WO 98/37066 PCT7US98/01 006 



Table 26 Equilibrium association constants (M~ 1 ) for polyamidcs. Q * c 



Polyamide 


S'-TGTTAACA-B' 


5'-TGTGAACA-3' 


Specificity^ 




2.5 x 10 9 


3.9 x |0 8 


6 


•^OOOOs 


6.6xlO» 


2^xl0 8 


26 


•o#oocx 


5x 10 9 


5xl0 9 


1 




*5x 10 B 


2.4 x 10 10 


*48 



Values reported for 1, 5, and 10 are the mean values obtained from 
three DNase 1 footprint titration experiments. & The assays were carried 
out at 22 °C at pH 7.0 in the presence of 10 mM Tris-HCl, 10 mM KC1, 
10 mM MgCl 2 , and 5 mM CaC^. c Match sice association constants 
and specificities higher than parent hairpins are shown in 
boldtype. ^Specificity is calculated as Rematch) / K a (mismatch). 



Modeling indicates that the (3-alanine residue relaxes ligand curvature, providing for 
optimal hydrogen bond formation between the floor of the minor groove and both Im-residues 
5 within the Im-P-Im polyamide subunit. This observation provided the basis for design of a 
hairpin polyamide, Im-p-ImPy-y-Im-p-ImPy-p-Dp, which incorporates Im/p pairings to 
recognize a "problematic" 5'-GCGC-3' sequence at subnanomolar concentrations. 

Table 27 Equilibrium association constamts (NT 1 ) for p olyamides 

Polyamide 5'-TGCGCA-3' SS'-TGGCCA-S* 5*-TGGGGA-3' 



15 





< 10 7 


<10 7 


3.7xI0 9 


1.4 x 10 8 


1.1 x 10 s 



Values reported are the mean values obtained from a minimum of three 
DNase I footprint titration experiments. ^The assays were carried out at 
22 °C at pH 7.0 in the presence of 10 mM! Tris-HCl, 10 mM KC1, 10 mM 
MgCIj, and 5 mM CaCl 2 . 

10 These results identify Im/p and p/Im pairings that respectively discriminate G*C and 

OG from A-T/T*A as well as Py/p and p/Py pairings that discriminate A-T/T*A from 
G*C/OG. These aliphatic/aromatic amino acid pairings will facilitate the design of hairpin 
polyamides which recognize both a larger binding site size as well as a more diverse sequence 
repertoire. 



EXAMPLE 10: 
POLYAMIDE BIOTIN CONJUGATES 

Bifunctional conjugates prepared between sequence specific DNA binding polyamides 
and biotin are useful for a variety of applications. First, such compounds can be readily attached 
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to a variety of matrices through the strong interaction of biotin with the protein streptavidin. 
Readily available strepdavidin-derivatized matrices include magnetic beads for separations as 
well as resins for chromatography. 

A number of such polyamide-biotin conjugates have been synthesized by solid phase 
synthetic methods outlined in detail above. Following resin cleavage with a variety of diamines, 
the polyamides were reacted with various biotin carboxylic acid derivatives to yield 
Afunctional conjugates. The Afunctional conjugates were purified by HPLC and characterized 
by MALDI-TOF mass spectroscopy and *H NMR. 

The scheme for the synthesis of an exemplary biotin-polyamide conjugate is shown 

below. 




The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing from the true spirit and scope of the invention. 
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What is claimed is: 

1. In a polyamide having at least three consecutive carboxamide pairs for 
binding to at least three DNA base pairs in the minor groove of a duplex 
DNA sequence having at least one A»T or T#A DNA base pair, the 
improvement comprising selecting a Hp/Py carboxamide pair to 
correspond to a T*A base pair in the minor groove of the duplex DNA 
sequence or selecting a Py/Hp carboxamide pair to bind to an A»T DNA 
base pair in the minor groove of the duplex DNA sequence, 

2. The polyamide of claim 1 wherein at least four consecutive carboxamide 
pairs bind to at least four DNA base pairs. 

3. The polyamide of claim 1 wherein at least five consecutive carboxamide 
pairs bind to at least five DNA base pairs. 

4. The polyamide of claim 1 wherein at least six consecutive carboxamide 
pairs bind to at least six DNA base pairs. 

5. The polyamide of claim 1 wherein the A»T or T»A base pair has a G»C 
or OG base pair on either side. 

6. The polyamide of claim 1 wherein the duplex DNA sequence is a 
regulatory sequence. 

7. The polyamide of claim 1 wherein the duplex DNA sequence is a 
promoter sequence. 

8. The polyamide of claim 1 wherein the duplex DNA sequence is a coding 
sequence. 

9. The polyamide of claim 1 wherein the duplex DNA sequence is a non- 
coding sequence. 

10. The polyamide of claim 1 wherein the binding of the carboxamide pairs 
to the DNA base pairs modulates the expression of a gene. 
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11. A composition comprising an effective amount of the polyamide of claim 
1 and a pharmologically suitable excipient. 

1 2. A diagnostic kit comprising the polyamide of claim 1 . 

13. A polyamide according to claim 1 having the formula: 

X1X2X3X4-Y-X5X6X7X8 
wherein y is -NH-CH2-CH2-CH2-CONH- hairpin linkage derived from 
y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4- 
diaminobutyric acid; X4/X5, X3/X6, X2/X7, and X\/Xs represent 
carboxamide binding pairs which bind the DNA base pairs wherein at 
least one binding pair is Hp/Py or Py/Hp and the other binding pairs are 
selected from Py/Im Im/Py to correspond to the DNA base pair in the 
minor groove to be bound. 

14. The polyamide of claim 13 wherein there is at least one p-alanine in a 
non- Hp containing binding pair. 

15. The polyamide of claim 13 wherein dimethylaminopropylamide is 
covalently bound to Xi or Xs« 

1 6. A polyamide selected from those listed in Tables 9-24 as 
compounds 1 through 240. 

17. A polyamide selected from shown in Fig. 4. 
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Py Hp Im 




2 ImImPyPy^ImHpPyP>^-Dp 

o . 




• o 
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5*-T GOT 



C A-3 



•♦OCX 

3'-A C C [AJG T-5 

Py/Py with T»A 

5-T G G|*]C A-3 

3 -A C C [Xj 0 T-5 

Py/Hp with T-A 

5'-T G G fflc A-3 

3-A C C [AjG T-5 

Hp/Py with T»A 



5'-T G G [a] C A-3' 
•♦CCK 

3*-A C C IiJg T-5' 

Py/Py with A*T 



5*-T G G\£ic A-3' 

8KXK M®*) 

3'-A C CgjO T-5 1 

Py/Hp with A-T 



5'-T G GfAlC A-3* 
•♦©On 

3-A C C [tJg T-5* 

Hp/Py with A-T 
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5'-T G 




A-3* 



3* -A C A{*J*-5' 
* d = 0.20iiM 



6-Ring Hairpin Hp-Py-Iro-Polyamides 

3'-A C AgjT-S' 3'-A cfejA A-5' 

K d = 0.28*iM tf d = 0.008 >iM 




JA A-5* 
K d = 0.33^iM 



8-Ring Hairpin Hp-Py-Im-Polyamides 



5'-T G G 
3-A C C 



C A-3' 



G T-S* 




K d = 0.48 nM 



t|o T-5' 

tf d =0.83 nM 




C A-3' 



G T-5' 



* d = 2.5 nM 



10-Ring Hairpin Hp-Py-Im-Polyamides 



5-T G 
3-A ClAjT 



G T-5' 

K d = 0.2 nM 



5-T G Gfirac A-3' 

3-A C AklTlC T-5' 
* d = 5nM 
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abed 
5'-W O T » W W-3» 



3'-W C A V II W-5 



dT 



d-A 



d-C 



c-T 



C-A 



c*0 



C-C 



5-W O T T T W-3' 



5'-W O T T A W-3' 



3-W C A A A W-5* 

111* S 4 7 4 

1 ) XoHpBpBp-Y-PyPyPyPy 



5-W OTTO W-3* 



5*-W O T T C W-3* 



3'-W C A A T W-5* ; 3-W C A T "C W-5' 

1 1 > * 14 7 4 11)4 I * 7 t 

2 > iMBpapPy-T-RpPypyPy j J)ia8pBpl»-v*y*y*y*y 



3'-W C A A O W-5* 

ill* » 4 7 • 
4 ) IaHp8pPy-7-I«PyPyPy 



5-W O T A T W-3* 



5*-W O T A A W-3' 



3-W C ATA W-S* 
i a l « J « 7 t 

5 > IuBpPyBp-Y-PyBpPyPy 



5*-W O T A O W-3' 



3'-W C A T T W-5* 



5'-W Q T A C W-3* 



3-W CATC W-5 



3'-W C A T O W-5* 

1114 943ft! 1)14 J) 4 7 I ) 1 ) 4 > 4 ? 4 

$)I*BpPyPy- T -BpBpryPy < 7 ) InHppy la-j-PyBpPyPy 8 ) IaBpPypy~ T -i«HppyPy 



5-W 



5*-W O T 0 A W-3' 



5-W O T O O W-3* 



5*-W 0 T 0 C W-3* 



3-W C A C A W-5* 

1114 S 4 7 • 
9) ImHpImBp-Y-PyPyPyPy 



3*-W C A C T W-5* 3 -W C A C C W-5* 3 -W C A C O W-5* 

1294 9471 11)4 4471 13)4 1 4 7 1 

lO)I«BpWy-T>BpPyPyPy .ip IftHpIaJ»-<r-PyPyPypy 1 2 ) I«8pI»Py- T -lRPyPyPy 



5-w 



3-W C A O A W-5' 

11)4 1471 

1 3 ) IaHpPyHp- Y -Pyl npypy 



5 '~* £,££,4 w ~ 3 ' 5 '-" <» * c o w-3* 



5*-W O T C C W-3* 



3-W C A 0 C W-5* 

11)4 4 4 7 



3'-W C A O O W-5 



3*-W C A 0 T W-5' 

1 1 ) • 9474 11)4 44TI 111* 4 C 1 ■ 

l«|IMpPyPY-T-apI->YPy lMlMpPyl-T-PyWyPy ltumppypy-T-iViTypy 



FIG. 14 



WO 98/37066 



PCT/US98/01006 



15/17 



abed 
» # -W 0 A V * W-3 1 

3 • -W C T W W W-5 1 



d*A 



5'-W O A * A WO' 
3-W C T A T W-5' 



T 



d-C 



5'-W O A T t W-3' ; 

EHBOEF 

3-W C T A A W-5* ! 

Ill* S * 7 4 ' 12)4 I 4 Y 4 | 13)4 lCIt 

17)I«PyBpHf>-T-PyPyBpPy i U>IaPyBpPy- y -BpPyapPy |19)IaPyBpI»- T -pypyBpPy 



i 3* 

I 

i 



»W G A T O W-3' 

Km 

-W C T A C W-5* 



5*-W O A T C W-3' 



3*-W C T A O W-5* 

11)* 9 4 7 4 

2 o ) IsPyBpFy-T»I«PyBpPy 



S'-W O A A T W-V 



S'-W 0 A A A W-3* 



5*-W O A A O W-3' 



5-W O A A C W-3* 



3-W C T I A W-5' 3-W CII!W.$' 



3'-W C T T C W-5' 

2 1 ) liPy PyHp-y-P~yHpBpPy 22 >I i\pyPyPy-y- BpHpBppV 1 2 3 ) lipyPy li-Y-PyB*paVy 



5 -W O A C T W-3' 



3 -W C T C A W-5* 
12)4 9 4)4 
2S ) InPyIi\Hp-y-PyPyBpPy 

5 -W O A C T W-3* 



5*-W GAGA W-3' 



5 -W O A 0 O W-3' 



3'-W C T T O W-5* 

1 I ) 4 t 4 T • 

24 )IaPyPyPy-y-i*8pBppy 
5 -W O A G C W-3' 



3'-W C T C t W-5* : 3'-W CtCCM-5' 

12)1 4 4 1 • 12)4 9 4 14 

26 > I»PyX«Py- T -BpPyBpPy j 27 ) I*PyImI»-y-p y pyBpPy 



S'-W G A C A W-3* 



I 



3-W C T O A W-5 

4 4 t 4 



S'-W O A C G W-3' 



3-W C T 0 T W-5 

_1_2 I 4 14*4 



3'-W C T C G W-5* 

12)4 9 4 14 

28 >I*PylBPy-T-I»pyBpPy 
5'-W O A C C W-3* 



3 -W C T 0 C W-5 



3'-W C T G G W-5 



2»)I«PyPyBp- T -PyI«BpPy 30 ) I«PyPyPy- T . H pl«BpPy . 31 ) ItfyPyl^-y.p'y MppV 32 )I»^PyPy-y-I«I»BpPy' 



FIG.15 



WO 98/37066 



PCT/US98/01006 



16/17 



abed 
S»-W O O W ■ W-3' 



3'-W C C » W W-5» 



d«T 



d^ 



5-W OOft W-3' 



5'-W O 0 t A W-3' 



5*-W 0 0*0 W-3* 



C-T 



3-W C C A A W-5 j -■■ w w . * - w - * - - 

1114 lilt 1114 1 4 7 I 1114 « 4 1 • 1114 lilt 

33}i»i«flpap-T-PyPyPyPy 3* >lal»BpPy-Y-Bp*ypyPy 35 ) XftXaBpX»-7-ryPyPyPy 3*)X»I«Hi 



3'-W C C A * W-5 



5'-W O O T C W-3* 



3-W C C A C W-5 



3 -W C C A O W-5' 

14 14 14 



s-w o o A T w-3' 



c-A 



c-o 



c-c 



3-W C C * A 

1114 5 4 ? I 

37 ) imi«i»yep-T-i > yBpPyPy 



5-W 



3-W 




5'-W 



W-5' 



1114 »»it 
3 9 ) l4tf ■PyPy-i-BpBpPyPy 




3-W 



W-3* 



39 ) lRl«Pyl»-T-PyBpPyPy 



3'-W C C T O ¥-5* 

1114 $17 4 
« 0 ) IalmPyPy-f-ImHpPyPy 



5'-W O O C T W-3' 



5'-* 



W-3' 



5'-W 0 0 0 0 W-3' 



5-W O O O C W-3' 



3-W C C C A W-5* 
ill* • « 7 l 
4 1 ) I»IaIaBp-T-PY*yPy*y 



3 -W C C C T W-3* 
1114 4 4 7 1 

4 2 ) ltaiU«Py-t-BpPyPyPy 



3-W C C C C W-3* 

1114 1471 

gi ) laimiBiB-T-pyPypypy 



3'-W C C C O W-5' 

1114 * 4 T • 

G2 )lml*lmPY-i-lmPy?yVy 



S-W ©OCT W-3' 



5'-W O 0 C A W-3* 



5 -W O O C 0 W-3' 



5'-W 0 O C C W-3' 
E^i {^B- 

3 -W CCO A H-5* J 3'-W 0~¥ W-5 1 I'-IT C C O C »-5' 3'-W C C O O W-5* 

1114 4411 1114 S • 1 I 1114 I I T • 1 « * 4 * 4 t • 

43 )iiilmPyBp-T-PyI«PyPy *4 ) ImlmPyPy-Y-BpIaPyPy G3 )I*I»?y I»-?-Py ImPyPy 64 )iBj«PyPy. T "l«l«PyPy 



FIG. 16 



WO 98/37066 



PCT7US98/01006 



17/17 



• bed 



3»-W C 0 V V W-S» 



d-C 



5'-W O C T T W-J' | 
J-M C O A A W-3* j 

1194 4 • 7 • I 1 a ) 4 1 « 7 I 

45>i«PyepBjMr-*yryi«*y ' *MlatfyBpPy. T -BpPyI»ry 



$'-W OCT* W-3' 
HOBOx 

3'-W COAT W-$' 



5 *- W 2**« W " 3 ' J*-" O C T C W-3' 



C O A C W-5' 3-W COAO W-S* 

1 1 * 4 9 4 9 4 Ill* 1 ft T ft 

47 | XaPyBpI^-y-PyPyimPy 4 a ) I«*yBpPy. T -iVyI«ry 



C-A 



c-0 



5'-W OCAS W-3' 



3- 



S'-W 0 C A A W-3' 



I 



3"-W O C A O W-3" 



-W C O T A W-S' 

i a i • » • a t . * * j . > » i t 

<9)l«PyPyap-Y-I^lpl«Pr I 50 H»I>yPyFy-T-Bp«plaPy 



3-W C O T T W-S' 
i a a 4 i • i 



S'-W O C A C W-3' 



3-W C O T C H-5 



3*-W C O T O W-5' 



5'-W O C O T W-3* 

3*-W COCA W-5* 
i a a 4 S49t 

53 ) luTylmBp-TtytylmPy 



5*-W O C C T W-3' 



5*-W O C O A W-3* 
3'-W C O C T W-5' 

9491 

54 )IaPyJ«Py-T-Bp*yX«ry 



■934 449| 1 J 1 4 « ft T ft 

51 ) I»PyryI»-T-Py«pImPy 52 ) ImPyPyPy. y -i«flpI«Py 

5'- If O C 0 0 W-3' 5'-W O C O C W-3* 



3*-W C O C C H-5* 3'-W COCOM 
' C5 ) WylftJ »-t - ImPy C* > lipyliu>y-v iaFyXmPy 



3*-W O C C A W-3' 



5'-W O C C O W-3' S'-W O C C C W-3* 



3 f*a 1 4° °4 A "~ 5 r-»C?TfW I 3 -wFo"oc"w-5' 5'.WCO?7lf.3 

SS^I^yPyBV^I^ 1 56 )lipW^BplV>jy j C7 ) lLpy^T-^V»V C8 ) I^Vy^- y-AA/Jy 



FIG. 17 



INTERNATIONAL SEARCH REPORT 



Intt tional Application No 

PCT/US 98/01006 



A. CLASSIFICATION OF SUBJECT MATTER , . 

IPC 6 C07D207/34 C07D233/90 A61K31/415 C07D403/14 C12Q1/68 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07D A61K Ct2Q 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Recognition of DNA 
subnanomolar 



J. W. TRAUGER ET AL: 
by designed ligands at 
concentrations" 
NATURE, 

vol. 382, no. 6591, 8 August 1996, 
pages 559-561, XP002066256 
cited in the application 
see the whole document 

S.E.SWALLEY ET AL: "Recognition of a 
5 , -(A,T)GGG(A,T)2-3' sequence in the minor 
groove of DNA by an eight-ring hairpin 
polyamide" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 
vol. 118, no. 35, 4 September 1996, 
pages 8198-8206, XP002066377 
see page 8198 - page 8202 

-/- 



13-15,17 



13-15,17 



Further documents are listed in the continuation of box C 



Patent family members are listed in annex. 



* Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the international 

filing date 

V document which may throw doubts on priority daim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



"T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"& M document member of the same patent family 



Date of the actual completion of theinternational search 



28 May 1998 



Date of mailing of the international search report 



J 2. 06. 98 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Voyiazoglou, D 



Form PCT/lSA/210 (second sheet) (July 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



tntt .tonal Application No 

PCT/US 98/01006 



C(Contlnuatlon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation oi document, with indication .where appropriate, of the relevant passages 



Relevant to claim No. 



A 

P.X 



P,X 



E. B. BAIRD ET AL: "Solid phase synthesis 
of polyamides containing imidazole and 
pyrrole amino acids" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 
vol. 118, no. 26, July 1996, 
pages 6141-6146, XP000674666 
cited in the application 
see page 6141 - page 6142 

M. E. PARKS ET AL: "Optimization of the 

hairpin polyamide design for recognition 

of the minor groove of ONA" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 

vol. 118, no. 26, July 1996, 

pages 6147-6152, XP000674668 

see page 6147 - page 6148 

M. E. PARKS ET AL : "Recognition of 
5'-(A,T)GG(AT)2-3' sequences in the minor 
groove of DNA by hairpin polyamides" 
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 
vol. 118, no. 26, July 1996, DC US, 
pages 6153-6159, XP000674667 
see page 6153 - page 6155 

WO 96 05196 A (PHARMACIA) 22 February 1996 
see claim 1 

S. E. SWALLEY ET AL : "Discrimination of 
5'-GGGG-3\ and 5'-GGCC-3' sequences in 
the minor groove of DNA by eight-ring 
hairpin polyamides" 

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 
vol. 119, no. 30, 30 July 1997, DC US, 
pages 6953-6961, XP002066260 
see page 6959 - page 6961 

W. L. WALKER ET AL: "Estimation of the 
DNA sequence discriminatory ability of 
hairpin-linked lexitropsins" 
PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES, U.S. A. , 
vol. 94, no. 11, May 1997, 
pages 5634-5639, XP002066261 
see table 1 



13-15,17 



13-15,17 



13-15,17 



13-15,17 
13-15,17 



13-15,17 



Form PCT/ISAA?10 (continuation of second shoot) (July 1992) 



page 2 of 2 



> 



INTERNATIONAL SEARCH REPORT 



int< tional application No. 

PCT/US 98/01006 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. [~J Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. PH Claims Nos.: 1-12,16 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 

The claims are so broad that for determining the scope of a neaningful 
international search due account has been taken of rule 33.3 PCT; special 
emphasis was put on the following subject-matter: claims 13-15,17; pages 1-34; 
examples 1-8 and figures 1-11 

3. Claims Nos.: tn t 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . 1 I As ail required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ■ searchable claims. 

2. f~J As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I | As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' » covers only those claims for which fees were paid, specifically claims Nos. : 



4. PJ No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | [ The additional search fees were accompanied by the applicant's protest. 

[ [ No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (1)) (July 1992) 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



Intt ional Application No 

PCT/US 98/01006 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 



WO 9605196 



22-02-1996 



AU 
AU 
CA 
CN 
EP 
FI 
HU 
JP 
NO 
NZ 
PL 
ZA 



689623 B 
3113695 A 
2172629 
1131946 
0722446 
961506 
76267 
9504039 T 
961377 A 
290404 A 
313821 A 
9506590 A 



02-04-1998 
07-03-1996 
22-02-1996 
25-09-1996 
24-07-1996 
05-06-1996 
28-07-1997 
22-04-1997 
30-05-1996 
24-04-1997 
22-07-1996 
18-03-1996 



Form PCT/ISA>210 {patenl tamily annex} <July 1992) 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 



□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




COLOR OR BLACK AND WHITE PHOTOGRAPHS 



Page Blank (uspto) 



